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AMENDMENTS TO THE CLAIMS 



This listing of the claims replaces all prior listings and versions: 



1. (canceled). ^ . 4 3g OB-^ 



2. (withdrawn and currently amended): An expression cassette comprising 
a polynucleotide comprising a sequence encoding an immunogenic Env polypeptide 
and h aving at least 90% percent identity to the full-length of the sequence shown in SEQ ID 
NQftfot 



3. (withdrawn and currently amended): The expression cassette of claim 2, 
comprising 

a polynucleotide comprising a sequence encoding an immunogenic Env polypeptide 
and having at least 90% percent identity to the full-length of the sequence shown in SEQ ID 
NO:47. 



^/W 4. (withdrawn and currently amended): The expression cassette of claim 3, 

comprising 

a polynucleotide comprising a sequence encoding an immunogenic Env polypeptide 
and h aving at least 90% percent identity to the full-length of the sequence shown in SEQ ID 
NO:49 or SEQ ED NO:97. . \ % A^Oft 

5 to 37. (canceled). nSi*)' 

/ (38\ (currently amended): An expression cassette comprising "7*7 C nj • t "" 

a polynucleotide comprising a sequence encoding an immunogenic Env polypeptide 
' and h avina at least 90% percent identityvt o the full-length sequence of the sequence shown in 
SEQ ID NOl 19>. SEP ID Nq^HsEO ID NG(-l2) SEQ ID Nq@; SEQ ID NO(l23-) 
SEQ ID NC@>, SEQ ID NOU2$; SEQ ID NO(2fc; SEQ ID NO:© SEQ ID NO^ST^ 
SEQ ID NQjfgbr SEQ ID Nqtf33} 




*** 



19 to 77. (canceled). 
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78. (withdrawn): A method of immunization of a subject, comprising, 
introducing an expression cassette of claim 38 into said subject under conditions that 

are compatible with expression of said expression cassette in said subject. 

79. (withdrawn): The method of claim 78, wherein said expression cassette is 
introduced using a gene delivery vector. 

80. (withdrawn): The method of claim 79, wherein the gene delivery vector is a non- 
viral vector. 

81. (withdrawn): The method of claim 79, wherein said gene delivery vector is a 
viral vector. 

82. (withdrawn): The method of claim 79, wherein said gene delivery vector is 
selected from the group consisting of an adenoviral vector, a vaccinia viral vector, an AAV 
vector, a retroviral vector, a lentiviral vector and an alphaviral vector. 

83. (withdrawn): The method of claim 82, wherein said gene delivery vector is a 
Sindbis-virus derived vector. 

84. (withdrawn): The method of claim 82, wherein said gene delivery vector is a 
cDNA vector. 

85. (withdrawn): The method of claim 82, wherein said gene delivery vector is a 
eukaryotic layered viral initiation system (ELVIS). 

86. (withdrawn): The method of claim 79, wherein said composition delivered using 
a particulate carrier. 

87. (withdrawn): The method of claim 79, wherein said composition is coated on a 
gold or tungsten particle and said coated particle is delivered to said subject using a gene 
gun. 
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88. (withdrawn): The method of claim 79, wherein said composition is encapsulated 
in a liposome preparation. 

. 89. (withdrawn): The method of claim 79, wherein said subject is a mammal. 

90. (withdrawn): The method of claim 89, wherein said mammal is a human. 

91. (withdrawn): A method of generating an immune response in a subject, 
comprising: 

providing an expression cassette of claim 38, 
expressing said polypeptide in a suitable host cell, 
isolating said polypeptide, and 

administering said polypeptide to the subject in an amount sufficient to elicit an 
immune response. 

92. (withdrawn): A method of generating an immune response in a subject, 
comprising 

introducing into cells of said subject an expression cassette of claim 38, under 
conditions that permit the expression of said polynucleotide and production of said 
polypeptide, thereby eliciting an immunological response to said polypeptide. 

93. (withdrawn): The method of claim 92, where the method further comprises co- 
administration of an HIV polypeptide. 

94. (withdrawn): The method of claim 93, wherein co-administration of the 
polypeptide to the subject is carried out before introducing said expression cassette. 

95. (withdrawn): The method of claim 93, wherein co-administration of the 
polypeptide to the subject is carried out concurrently with introducing said expression 
cassette. 

96. (withdrawn): The method of claim 93, wherein co-administration of the 
polypeptide to the subject is carried out after introducing said expression cassette. 

97. (canceled). 
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Here is the list of the requested sequences: 



<210> SEQ ID NO 120 
<211> LENGTH: 1986 
<212> TYPE: DNA :; 

<213> ORGANISM: Artificial Sequence 
< 2 2 0 > FEATURE : ' 
<223> OTHER INFORMATION: Description of Artificial Sequence: gpi'4 Omod . TVl . d 
<400> SEQUENCE: 120 

gaattcatgc/'gcgtgatggg 



ctgggcttct^tggatgctgat 
ggcgtgcccgj-tgtggcgcga 
tacgagaccg^aggtgcacaa 
ccccaggaga.^tcgtgctggg 
gccgaccaga^gcacgagga 
aagctgaccc^ccctgtgcgt 
accgtgacccj^gcaacagcac 
atgaagaact^gcagcttcaa 
atcacccagg^cctgccccaa 
gccggctacg|!pcatcctgaa 
aacgtgagc^ccgtgcagtg 
ctgaacggca^gcctggccga 
accaagacc^^catcgtgca 
aacaacaccc>!gcaagagcgt 
gtgatcggca^catccgcca 
ctgcagcaggj*tgatgaagaa 
ccccacgccg.^gcggcgacct 
ttctactgca.i-acaccagcaa 
aagtacaacg gcaacagcag 
cgcatgtggc ragggcgtggg 
tgccgcagca acatcaccgg 
aacaccgaga .ccttccgccc 
tacaagtaca aggtggtgga 
cgcgtggtgc agcgcgagaa 
ggcgccgccg gcagcaccat 
ctgctgagcg gcatcgtgca 
cacatgctgc agctgaccgt 
gagcgctacc tgaaggacca 
tgcaccaccg ccgtgccctg 
gacaacatga cctggatgca 
aacctgctgg aggacagcca 
gacaagtgga acaacctgtg 
ctcgag 



cacccagaag ^itctgccagc 
gatctgcaac ^aqcgaggacc 
cgccaagacc ^accctgttct 
cgtgtgggcc racccacgcct 
caacgtgacc gaEf aacttca 
cgtgatcagclietgtgggacc 
gaccctgaacs^tg^accgaca 
caacaacacc";aacggcaccg 
^tofdqgcctga 



cgccggcgcc; 
ggtgagcttc^^gcjqccatcc 
gtgcaacaae-aagaccttca 
cacccacggccafeaagcccg 
ggagggcatc^tcatccgca 
cctgaacgag '^gcgtggaga 
gcgcatcggc. |Ggcggccag^ 
ggcccactgc "a'acatcagca 
gctgggcgag^cScttcccca 
ggagatcacc j-atgcacagct 
cctgttcaac agcacctacc 
cagccccatc" accct gcagt 
ccaggccacc j-tacgcccccc 
catcctgctg acccgcgacg 
cggcggcggc gacatgcgcg 
gatcaagccc ;ctgggcatcg 
gcgcgccgtg ggcatcggcg 
gggcgccgcc agcatcaccc 
gcagcagagc aacctgctga 
gtggggcatc aagcagctgc 
gcagctgctg ggcatctggg 
gaacagcagc tggagcaaca 
gtgggaccgc gagatcagca 
gaaccagcag gagaagaacg 
gaactggttc gacatcagca 



agtggtggat 
tgtgggtgac 
gcgccagcga 
gcgtgcccac 
acatgtggaa 
agagcctgaa 
ccaacgtgac 
gcatctacaa 
tcaactgcaa 
ccatccacta 
acggcaccgg 
tggtgagcac 
gcgagaacct 
tcaactgcac 
ccttctacgc 
ccgaccgctg 
acaagaccat 
tcaactgccg 
acagcaacaa 
gcaagatcaa 
ccatcgccgg 
gcggcttcaa 
acaactggcg 
cccccaccaa 
ccgtgttcct 
tgaccgtgca 
aggccatcga 
aggcccgcgt 
gctgcagcgg 
agagcgagaa 
actacaccgg 
agaaggacct 
actggccctg 



ct ggggcat c 60 



cgfcgtactac 12 0 

cgccaaggcc 18 0 

cgarccccaac 240 

gaacgacat g 300 

gccfatgcgtg 360 

cggcaaccgc 420 

c4fcc.5aggag 480 

caeca gcacc 540 

c^gjcgccccc 600 

ccprrfcgctac 660 

ccjagctgctg 720 

gaccgagaac 780 

ccbrcccaac 840 

■ sea • ' 

caociaacgac 900 

gaacaagacc 960 
ccargttcaag 
eggegagtte 
cggcacctac 
geagategtg 
caacatcacc 



1020 
1080 
1140 
1200 
1260 
1320 



caccaccaac 
cagegagctg 1380 
ggccaagcgc 1440 
gggcttcctg 1500 
ggcccgccag 1560 
ggcccagcag 1620 
gctggccatc 
ccgcctgatc 
ggacatctgg 
cctgatctac 
gctggagctg 
gtacatctaa 



1680 
1740 
1800 
1860 
1920 
1980 
1986 
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performed in any order. In any of the embodiments described herein, the nucleic acid 
molecules can encode all, some or none of the polypeptides. Thus, one or more or the 
nucleic acid molecules (e.g., expression cassettes) described herein and/or one or more of 
the polypeptides described herein can be co-administered in any order and via any 
administration routes. Therefore, any combination of polynucleotides and/or 
polypeptides described herein can be used to generate elicit an immune reaction. 

Experimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not intended 
to limit the scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
amounts, temperatures, etc.), but some experimental error and deviation should, of 
course, be allowed for. 

Example 1 

Generation of Synthetic Expressio n Cassettes 
^ Mortification of HTV-1 Env. Ga * Pnl Nucleic Acid Coding Sequences 

The Pol coding sequences were selected from Type C strain AF1 10975. The Gag 
coding sequences were selected from the Type C strains AF1 1 0965 and AF1 10967. The 
Env coding sequences were selected from Type C strains AF1 10968 and AF1 10975. 
These sequences were manipulated to maximize expression of their gene products. 

First, the HIV-1 codon usage pattern was modified so that the resulting nucleic 
acid coding sequence was comparable to codon usage found in highly expressed human 
genes. The HTV codon usage reflects a high content of the nucleotides A or T of the 
codon-triplet. The effect of the HIV-1 codon usage is a high AT content in the DNA 
sequence that results in a decreased translation ability and instability of the mRNA. In 
comparison, highly expressed human codons prefer the nucleotides G or C. The coding 
sequences were modified to be comparable to codon usage found in highly expressed 
human genes. 

Second, there are inhibitory (or instability) elements (INS) located within the 
coding sequences of the Gag and Gag-protease coding sequences (Schneider R, et al., J 
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Virol. 71(7):4892-4903, 1997). RRE is a secondary RNA structure that interacts with the 
HTV encoded Rev-protein to overcome the expression down-regulating effects of the INS. 
To overcome the post-transcriptional activating mechanisms of RRE and Rev, the 
instability elements are inactivated by introducing multiple point mutations that do not 
alter the reading frame of the encoded proteins. Figures 5 and 6 (SEQ ED Nos: 3, 4, 20 
and 21) show the location of some remaining INS in synthetic sequences derived from 
strains AF1 10965 and AF1 10967. The changes made to these sequences aire boxed in the 
Figures. In Figures 5 and 6, the top line depicts a modified sequence of Gag polypeptides 
from the indicated strains. The nucleotides) appearing below the line in the boxed 
region(s) depicts changes made to further remove INS. Thus, when the changes indicated 
in the boxed regions are made, the resulting sequences correspond to the sequences 
depicted in Figures 1 and 2, respectively. 

The synthetic coding sequences are assembled by methods known in the art, for 
example by companies such as the Midland Certified Reagent Company (Midland, 
Texas). 

In one embodiment of the invention, sequences encoding Pol-polypeptides are 
included with the synthetic Gag or Env sequences in order to increase the number of 
epitopes for virus-like particles expressed by the synthetic, modified Gag/Env expression 
cassette. Because synthetic HTV-1 Pol expresses the functional enzymes reverse 
transcriptase (RT) and integrase (INT) (in addition to the structural proteins and 
protease), it may be helpful in some instances to inactivate RT and TNT functions. 
Several deletions or mutations in the RT and TNT coding regions can be made to achieve 
catalytic nonfunctional enzymes with respect to their RT and INT activity. {Jay. A. 
Levy (Editor) (1995) The Retroviridae, Plenum Press, New York. ISBN 0-306-45033X. 
Pages 215-20; Grimison,B. and Laurence, J. (1995), Journal Of Acquired Immune 
Deficiency Syndromes and Human Retrovirology 9(l):58-68; Wakefield, J. K.,et al., 
(1992) Journal Of Virology 66(11):6806-6812; Esnouf, R.,et al., (1995) Nature 
Structural Biology 2(4):303-308; Maignan, S., et al., (1998) Journal Of Molecular 
Biology 282(2):359-368; Katz, R. A. and Skalka, A. M. (1994) Annual Review Of 
Biochemistry 73 (1994); Jacobo-Molina, A., et al., (1993) Proceedings Of the National 
Academy Of Sciences Of the United States Of America 90(13):6320-6324; Hickman, A. 
B., et al., (1994) Journal Of Biological Chemistry 269(46):29279-29287; Goldgur, Y., et 
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Table D 



Name 


Seq Id 


Description 


gpl20mod.TVl.delV2 


119 


synthetic sequence of Env gpl20, including V2 
deletion and modified leader sequences derived 
from wild-type 8_2JTV1_C.ZA sequences 


gpl40mod.TVl.delV2 


120 


synthetic sequence of Env gpl40, including V2 
deletion and modified leader sequences derived 
from wild-type 8_2_TVl_C.ZA sequences 


gpl40mod.TVl .mut7.delV2 


121 


synthetic sequence of Env gpl40, including V2 
deletion and mutation in cleavage site and 
modified leader sequences derived from wild- 
type 8_2_TV1_CZA sequences 


gpl60mod.TVl .delVlV2 


122 


synthetic sequence of Env gpl60, including 
VI N2 deletion and modified leader derived 
from wild-type 8_2_TV1_C.ZA sequences 


anl fiOmnri TV1 delV2 


123 


synthetic sequence of Env gpl60, including V2 
deletion and modified leader sequences derived 
from wild-type 8_2_TV1_C.ZA sequences 


<ml fiflmnd TV1 mut7 delV2 

VUl UvlIlUVl* XVI .1111*1 1 .Uvi ▼ 


124 


synthetic sequence of Env gpl60, including V2 
deletion; a mutation in cleavage site; and 
modified leader sequences derived from wild- 
type 8_2_TV1_C.ZA sequences 


gpl60mod.TVl.tpal 


125 


synthetic sequence of Env gploO, TPA1 leader 


gpl60mod.TVl 


126 


synthetic sequence of Env gpl60, including 
modified leader sequences derived from wild- 
type (8_2_TV1_C.ZA) sequences 


gp 1 60mod.TV 1 .wtLnati ve 


127 


synthetic sequence of Env gpl60, including 
wild type 8_2_TV1_C.ZA (unmodified) leader 


gpl40.mod.TVl.tpal 


131 


synthetic sequence of Env gpl40, TPA1 leader 


gpHOmod.TVl 


132 


synthetic sequence of Env gpl40, including 
modified leader sequences derived from wild- 
type 8_2_TV1_C.ZA sequences 


gp 1 40mod.TV 1 . wtLnative 


133 


synthetic sequence of Env gpl20, including 
wild type 8_2_TV1_C.ZA (unmodified) leader 
sequence. 
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As noted above, Env-encoding constructs can be prepared using any of the full- 
length of gpl60 constructs. For example, a gp!40 form (SEQ ID NO:132) was made by 
truncating gpl60 (SEQ ID NO: 126) at nucleotide 2064; gpl20 was made by truncating 
gpl60 (SEQ ID NO:126) at nucleotide 1 551 (SEQ ID NO:126). Additional gpl40 and 
gpl20 forms can be made using the methods described herein. One or more stop codons 
are typically added (e.g., nucleotides 2608 to 2610 of SEQ ID NO:126). Further, the 
wild-type leader sequence can be modified and/or replaced with other leader sequences 
{e.g, TPA1 leader sequences). 

Thus, the polypeptide gpl60 includes the coding sequences for gpl20 and gp41. 
The polypeptide gp41 is comprised of several domains including an oligomerization 
domain (OD) and a transmembrane spanning domain (TM). In the native envelope, the 
oligomerization domain is required for the non-covalent association of three gp41 
polypeptides to form a trimeric structure: through non-covalent interactions with the 
gp41 trimer (and itself), the gpl20 polypeptides are also organized in a trimeric structure. 
A cleavage site (or cleavage sites) exists approximately between the polypeptide 
sequences for gpl20 and the polypeptide sequences corresponding to gp41. This cleavage 
site(s) can be mutated to prevent cleavage at the site. The resulting gpl40 polypeptide 
corresponds to a truncated form of gpl60 where the transmembrane spanning domain of 
gp41 has been deleted. This gp!40 polypeptide can exist in both monomelic and 
oligomeric (z.e. trimeric) forms by virtue of the presence of the oligomerization domain in 
the gp41 moiety. In the situation where the cleavage site has been mutated to prevent 
cleavage and the transmembrane portion of gp41 has been deleted the resulting 
polypeptide product is designated "mutated" gpl40 (e.g., gpl40.mut). As will be 
apparent to those in the field, the cleavage site can be mutated in a variety of ways. In the 
exemplary constructs described herein (e.g., SEQ ID NO:121 and SEQ ID NO:124), the 
mutation in the gpl20/gp41 cleavage site changes the wild-type amino acid sequence 
KRRWQREKR (SEQ ID NO: 129) to ISSWQSEKS (SEQ ID NO: 130). 

In yet other embodiments, hypervariable region(s) were deleted, N-glycosylation 
sites were removed and/or cleavage sites mutated. Exemplary constructs having variable 
region deletions (VI and/or V2), V2 deletes were constructed by deleting nucleotides 
from approximately 499 to approximately 593 (relative to SEQ ID NO:128) and V1/V2 
deletes were constructed by deleting nucleotides from approximately 375 to 
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\ : is the regions for p-sheet deletions 

*: is the N-linked glycosylate sites for subtype C TV1 and TV2. Possible mutation (N-» Q) or 
deletions can be performed. 



o 
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x _ . 50 

(!) MDAMKRGLCC^LfcCGAVFySPSATO 

(1) MRVMGTQKNCQQWW I WG I LGFWMLMI ^-^^^^^^^^^^^S^^ 1 ^^^^; 
(1) MRVMGTQKNCQQWWIWGIl|jFWM^ 

(i) tm^iiJ^r£BBmi^^ 

(1) ^VMGTQKNCQQWWIWGILGFW^ 

100 




I^^YETEVHNVWATHACVPTDPNPQE 



(51) FCASDi 

P2/V1V2/P3 

101 

(97) ^MH^DlJSX^&g 
(101) QMHpD^fe 
(101) {SHHEglgS 

(ioi) ^^*||]^^ A 




* * 150 

LHeTOTLKNATNTK -SSN- - - 

" C^TNVTGfeTVTGN§TNNTNb 
(DTNVTG^T\nX3N|GNpTN I A 
lTVNYN -NjTS--- 



(101) QMHEE 



151 

(139) WKEMDRGE^ K . 
(151) TGIYNIEE 
(151) NATYKYEI 

(151) A Y EEM^CS^raLRDKKHKEYALFYKT^IVPLm ENSNNFTY 




*200 

IDN DNTSJT 

'LN--ENSDNFT3C 
--ENSNNFTY 
IV^LNNRKNC^INN^ 
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(201) RLINCNTSTIT 



^ACPKVSFDPIPIHYCAPAGYAILKCNNKTFNGTGPCYN 



251 
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3 °° 

(235) V^OT^^fei^^^^^^^^^^^fc 11 — ~ 

(251) WTVQCraQi KPVVSTQLLLNGSLAEEGI I IRSENLTENTKT 1 1 VHLNE S 
301* 

(285) vgpflSgfc 

(299) VBiBNj 
(299) 

(30i) veinct^ STDRWNKTL 
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ClustalW Results 



Results of search ^ 


Number of sequences 




Alignment score 


1075087 


Sequence format 


Pearson 1£* ' "* 


Sequence type 
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ClustalW version 




JalView 




Output file 


liiistalw-20050726-2251 i ^.output 


Alignment file 


| jilStalw-20050726-2251 1 M.fa 


Guide tree file 


fiUstalw-20050726-225lim^d 
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Scores Table 



SeqA Name 


Len(nt) 


SeqB 


Name 


Len(nt) 


Scor> 














1 seqid46 


97 


2 


seqid47 


144 


100 


1 seqid46 


97 


3 


seqidll9 


1473 


97 


1 seqid46 


97 


4 


seqidl20 


1986 


97 


1 seqid46 


97 


5 


seqidl21 


1986 


97 


1 seqid46 


97 


6 


seqidl22 


2397 


97 


1 seqid46 


97 


7 


seqidl23 


2529 


97 


1 seqid46 


97 


8 


seqidl24 


2529 


97 
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1 
1 
1 
1 
1 

2 
2 
2 
2 
2 
2 

2 - 
2 

2 - 

2 - 
2' ; 

3 

3^' 

3 V 



5 
S 
5 
5 
5 
5 
5 
5 
5 
6 
6 
6 
6 
6 
6 
6 
6 
7 
7 
7 
7 



seqid46 
seqid46 
seqid46 
seqid46 
seqid46 
seqid46 
8eqid47 
seqid47 
seqid47 
seqid47 
seqid47 
seqid47 
seqid47 
seqid47 
seqid47 
seqid47 
seqid47 
. seqid47 
seqidll9 
seqidll9 
1 seqidll9 
h seqidll9 
seqidll9 
seqidll9 
seqidll9 
" seqidll9 
(seqidll9 
; seqidlJ.9 
;seqidll9 
■secjidl20 
seqidl20 
: 8eqidl20 
^seqidl20 
seqidl20 
seqidl2 0 
*seqidl2 0 
seqidl20 
seqidl20 
seqidl20 
seqidl21 
seqidl21 
seqidl21 
seqidl21 
seqidl21 
seqidl21 
seqidl21 
seqidl2l 
seqidl2i 
Beqidl22 
seqidl22 
seqidl22 
seqidl22 
seqidl22 
seqid!22 
seqidl22 
seqidl22 
seqidl23 
seqidl23 
seqidl23 
seqidl23 



seqidl25 

seqidl26 

seqidl27 

seqidl31 

seqidl32 

seqidl33 

seqidll9 

seqidl20 

seqidl2l 

seqidl22 

seqidl23 

seqidl24 

seqidl25 

seqidl26 

seqidl27 

seqidl31 

seqidl32 

seqidl33 

seqidl20 

seqidl21 

seqidl22 

seqidl23 

seqidl24 

seqidl25 

seqidl26 

seqidl27 

seqidl31 

seqidl32 

seqidl33 

seqidl21 

seqidl22 

seqidl23 

seqidl24 

seqidl25 

seqidl26 

seqidl27 

seqidl31 

seqidl32 

seqidl33 

seqid!22 

seqidl23 

seqidl24 

seqidl25 

seqidl26 

seqidl27 

seqidl31 

seqidl32 

seqidl33 

seqidl23 

seqidl24 

seqidl25 

seqidl26 

seqidl27 

seqidl31 

seqidl32 

seqidl33 

se<iidl24 

seqidl25 

seqidl26 

seqidl27 



i; is 
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7 


seqidl23 


2529 


12 


seqidl31 


2052 


91 


7 


seqidl23 


2529 


13 


seqidl32 


2073 


95 


7 


seqid!23 


2529 


14 


seqidl33 


2073 


94 


8 


seqid!24 


2529 


9 


seqidl25 


2613 


96 


8 


seqidl24 


2529 


10 


seqidl26 


2616 


99 


8 


seqidl24 


2529 


11 


seqidl27 


2616 


99 


8 


seqi(il24 


2529 


12 


seqidl31 


2052 


91 


8 


seqidl24 


2529 


13 


seqidl32 


2073 


95 


8 


seqidl24 


2529 


14 


seqidl33 


2073 


94 - 


9 


seqid!25 


2613 


10 


seqidl26 


2616 


96 r 


9 


seqid!25 


2613 


11 


seqidl27 


2616 


96t£- 


9 


seqidl25 


2613 


12 


seqidl31 


2052 


99 


9 


seqid!25 


2613 


13 


' seqidl32 


2073 


95 ~ 


9 


seqidl25 


2613 


14 


--Beqidl33 


2073 


95 : sa 


10 


seqidl26 


2616 


11 


seqidl27 


2616 


99 S': 


10 


seqidl26 


2616 


12 


-seqidl31 


2052 


96;o- 


10 


seqidl26 


2616 


13 


^seqidl32 


2073 


99 


10 


seqid!26 


2616 


14 


<4seqidl33 


2073 


993e~ 


11 


seqidl27 


2616 


12 


v:;Seqidl31 


2052 


. 96^^-- 


11 


seqid!27 


2616 


13 


m ^seqidl32 
; : ^seqidl33 


2073 




11 


seqidl27 


2616 


14; 


2073 




12 
12 


seqid!31 
segid!31 


2052 
2052 


13. 


;Sseqidl32 
i^silqidl33 


2073 
2073 




13 


seqldl32 


2073 


14 


J^seqidl33 


2073 


.3. 9^^: 



PLEASMNpTE: Some scores ma^^m^singfri 



om the above i 



[ I Segue nee Number IB! j 






ifihealig 



CLUSTAL W (1.82) multiple sequence alignment 



seqidl!9 

seqidl20 

seqidl21 

seqidl32 

seqid!33 

seqidl22 

seqidl23 

seqidl24 

seqid!26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



GAATTCATG CGCQTGATGGGCAC C CAGAAG AACTGCCAGCAGTGGO 
GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTGG1 
GAATTCATGCGCX3TGATGGGC AC CC AGAAG AACTGCCAGCAGTGGT 
GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTGG'3 
GAATTCATGAGAGTGATGGGGACACAGAAGAATTGTCAACAATGGT 
GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTGGT 
GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTGG'3 
GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTGG'] 
GAATTCATG03CGTGATGGGCACCCAGAAGAACTGCCAGCAGTGGT 
GAATTCATG AG AG TG ATGGGG AC ACAGAAG AATTGTCAACAATGGT 

GTCGACGCCAC CATGGATG CAATGAAG AGAGGGCTCTGCTGTC 

. ATGGATGCAATGAAGAGAGGGCTCTGCTGTC 
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seqidll9 

seqidl20 

seqid!21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGC 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGG< 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGC 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGG< 
TTAGGOTTCTGGATGCTAATGATTTGTAACACCGAGGACCTGTGGC 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGC 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGC 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGC 
CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGC 
TTAGGCTTCTGGATGCTAATGATTTGTAACACCGAGGACCTGTGGC 



-:>^ GGAGCAGTCTTCGTTTCGCCCAGCGCCAGCACCGAGGACCTGTGGC 
GGAG ^GTCTTCGTTTCGCCCAGCGCCAGCACCGAGGACCTGTGG<: 



seqidll9 
seqid!20 
seqidl21 
seqidl32 
seqidl33 
seqidl22 
seqidl23 
seqid!24 
seqidl26 
seqidl27 
seqid46 
seqid47v> > 
seqidl25 
seqidl31 



seqidll9 
seqidl20 
seqidl2l 
seqidl32 
seqidl33 
seqidl22 
seqidl23 
seqid!24 
seqidl26 
seqidl27 
seqid46 
seqid47 
seqidl25 
seqidl31 




f GGCGTGCCCGTGTGGCGCGACGCCAAGACCACCCTGTTCTGCGCCi 
^ GGCGTGCCCGTGTGGCGCGACTCCAAGACCTVCCCTGTTCTGCGCCi 
t GGCGTGCCCGTGTGGGGCGACTCCAAGACCACCCTGTTCTGCGC& 
_ GGCGTGCCCGTGTGGCGCGAC^CAAGACCACCCTGT^ 
TCCGTGCCCGTGTGGCGCGACOCCAAGACCACCCrGTTCTGOT 
GGKXntSCCCGTGTGGCGCC^ : 

g<^tgccx:gtgtggcgcgac^ 

\ ggcx3tgcccgtgtggcgcgacgco v 
^ggostgcccgtgtggcgcgacScc^ 
~ mcgtgcccgtgtggcgcgag^caagaccaccctgttctgcxics- 



^(^CGTGCCMTGTGGCXX:GAC& 
* <^CX3TGCCCGTGTGGCGCGAcS£^^ 




jtacg^accgaggtgcac^e^otgggccacccacgcctgc 
I^tacgagaccgaggtgcacaac^ 

I^TACGAGACCGAGGTGCACAACotGTGGGCCACCCACGCCTG 

p tacgagaccgaggtgc^caao|^ 

o TACGAGACCGAGGTGCTICAACX^^ 

TACGAGACCGAGGTGCACAAO^TGGGCCACCCAC^ * 

TACGAGACCGAGGTGCACAACGTOTGGGCCACCCACGCCTGCG 
i^^s * TACGAGACCGAGGTCCACAAOTTCTGGGCCACCCACGCCTGCGTCC 
TACGAGACCGAGGTGCACAACGTCTGGGCCACCCACGCCTGCGTGC 



TACGAGACCGAGGTGCACAACGTGTGGGCCACCCACGCCTGCGTGC 
TACGAGACCGAGGTGCACAACGTGTGGGCCACCCACGCCTGCGTGC 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 



CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATG'] 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATG'3 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATGO 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATGI 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATGT 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATGT 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATGT 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATGO 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATG'] 
CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATG*) 
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seqidl25 
seqidl31 



CCCCAGGAGATCGTGCTGGGCAACGTGACCGAGAACTTCAACATG1 
CCCCAGGAGATCGTGCTGGGCAACGTGACCXJAGAACTTCAACATGT 



seqidll9 
seqidl20 
seqidl21 
seqidl32 
seqidl33 
seqidl22 

- - seqidl23 

seqidl24 
T seqidl26 
seqidl27 
seqid46 

- seqid47 
*V. seqidl25 

^Aseqidl31 

/^t v seqidll9 
£:Seqidl20 
*seqidl21 
seqidl32 
,seqidl33 
rseqidl22 
t seqidl23 
| seqidl24 
| seqidl26 
ifeqidl27 
>;seqid46 
*-seqid47 
?Beqidl25 
|-seqidl3i 



1 seqidll9 
: seqidl20 
seqidl21 
seqidl32 
seqidl33 
seqidl22 
seqidl23 
seqidl24 
seqidl26 
seqidl27 
seqid46 
seqid47 
seqidl25 
seqidl31 



GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 



GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 
GCCGACCAGATGCACGAGGACGTGATCAGCCTGTGGGACCAGAGCC 

i^L" \^ 

AAGC^CCCCCCTGTGCGra AA<j^£C: 

AAGCT^CCCCCCTGTGCGTGACC^ ' -~<^r* " ' 
AAGCJ^CCCCCCTGTGC^^ 

AAGC|^CCCCCCTGTGCGTGACCCT<^ J3«j 

AAGCT^CCCCCCTGTGCGTGACCCTG^^ ~ " 

AAGefOACCCCCCTGTGCGTG- - - -A^jEj- 

AAG<i|§MCCCCCTGTGCGTGACCC^^ Agg 

AAGk2TGACCCCCCTGTGCGTGAC^ AAG 

"^CCCCCCTGTGCGTGACCCT^^ " ' ~ 
^CCCCCTGTGCGTGACCC^ 




AAGqjg^CCCCTGTGCGTGArc 
AAGCI^CCCCGCTGTGCGTGACCCTOAM 






ACCGT^CCGGCAACAGCACCAACAA»CCAACGGCACCGGCATC3 A«^£?. : 

ACCGlmCCGGCAACAGC^CCAACAA&CCAA :\CQGT6:\ Z ' 

ACCG-^CCGGCAACAGCACCAACy^ Rg^tfe.^: 

ACCGT^CCXK5CAACAGCACCAAC^»CCAAC^ ^Ow" O r 

ACCGT^CCGGCAACAGCACCAACAAC^C^C "XfCGTC." 

ACCGT^CCGGCAACAGCACCAACAACACCAACGGCACCGGCATC3 -CCC \". 

ACCGTGACCGG CAACAGC ACC AACAACACCAACGGCACCGGCATC3 -.COG ' . 
ACCGTGACCGGCAACAGCACCAACAACACCAACGGCACCGGCATCT 

ACCGTGACCGGCAACAGCACCAACAACACCAACGGCACCGGCATO '?S 



ACCGTGACCGGCAACAGCACCAACAACACCAACGGCACCGGCATCT 
ACCGTGACCGG CAACAGCACCAACAACACCAACGGCACCGGCATC3 



seqidl 1 9 ATGAAGAACTGCAGCTTCAACGCCGGCGCCGG 

seqidl20 ATGAAGAACTGCAGCTTCAACGCCGGCGCCGG 

seqidl21 ATGAAGAACTGCAGCTTCAACGCCGGCGCCGG 

seqidl32 ATGAAGAACTGCAGCTTCAACGCCACCACCGAGCTGCGCGACAAG; 

seqidl33 • atgaagaactgc^gcitcaacgccaccaccgagctgcgcgacaag; 

seqidl22 GGCGCCGG 

seqidl23 ATGAAGAACTGCAGCTTCAACGCCGGCGCCGG 

seqidl24 ATGAAGAACTGCAGCTTCAACGCCGGCGCCGG - 
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seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



atgaagaactgcagcttcaacgccaccaccgagctgcgcgacaag; 
atgaagaactgcagcttcaacgccaccaccgagctgcgcgacaag; 



atgaagaactgcagcttcaacgccaccaccgagctgcgcgacaag; 
atgaagaactgcagcttcaacgccaccaccgagctgcgcgacaag; 



m" 



* l. Ji 



seqidll9 

seqidl20 

seqidl21 

seqid!32 

seqidl33 

seqidl22 

seqid!23 

seqidl24 

seqidl26 

seqi4l27 

seqjd46 

seqid47; 

seqi^l25 

seqid:i3i: 



see 



seqidlia 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



gccctgttctaccgcctggacatcgtgcccctgaacgagaacagcc 
gccctgttctaccgcctggacatcgtgcccctgaacgagaacagcc 



GCCCTGTTCTACCGCCTGGACATCGTGCCCCTC^tX3AGAACAGCC 
GCCCTGTTCTACCXSCCTGGACATCGTGCCCCTG^ 

GCCCTGTTCTAGMCCTGGACATCGTGCCCCTG^p3AGAACAGCC 
GCCCTGTTCriArcGCCrGGACATCGTGCCCC 





CGCCTX3ATO\W^CAACACCAGCACC 
CGCCTGATCtfi|||i£ 

CGCCTGATCA^^CAACACCAGC^CCATCACe^^dGCTGCCC^ 
CGCCTGATC&a|^^ 

CGCCTGATCAAOTGCAACACCAGCACCATCAGi 

^j^GCAACACCAGCACCATCAl 

CGCCTOATCA^^GCAACACC^GCACCATCACGl 
CGCCTGATCAAETOa^C^CCAGCACC^TCACi 
CGCCroATC^^pGCAACACCAGCACCATCAG 
CGCCTCATCAA6?tGCAACACCAGCACCATCACi 



ICTGCCCCi 
SdGCTGCCCC; 

igtgcccc; 

CTGCCCC2 
iTGCCCCi 
CTGCCCCJ 
CTGCCCCJ 




CGCCTC^TCAAC^ 

CGCCTGATC^(|raC^CACC^CACCATC^^^ 
CCC^VTCCCC^TCCACrACreCGCCCCC^ 

cccatccccatccactactgcgcccccgcc^gt^gccatcctgi 
cccatccccatccacracrrgcgcccccx^cgggtacgccatcctgj 
cccatccccatcc^ctactgcgcccccgccggcracgccatcctgi 
cccatccccatccactactgcgcccccgccggctacgccatcctgi 
cccatccccatccactactgcgcccccgccggctacgccatcctgi 
cccatccccatccactactgcgcccccgccggctacgccatcctg; 
cccatccccatccactactgcgcccccgccggctacgccatcctgi 
cccatccccatccactactgcgcccccgccggctacgccatcctg; 
cccatccccatccactactgcgcccccgccggctacgccatcctg; 



cccatccccatccactactgcgcccccgccggctacgccatcctgj 
cccatccccatccactactgcgcccccgccggctacgccatcctg; 



seqidll9 ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG1 

seqid!20 ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG') 

seqidl2 1 ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG'3 

seqid!3 2 ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG') 
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seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAGI 
ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG1 
ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAGO 
ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG1 
ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG1 
ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAGT 



ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG1 
ACCTTCAACGGCACCGGCCCCTGCTACAACGTGAGCACCGTGCAG1 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqid!27 

seqid46 

seqid47 

seqidl25 

seqidl31 



seqidllS 

seqidl20 

seqid!2i* 

seqidl32 

seqidl33 

seqidl22 

seqid!23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



■ S ^GCCCGTGGTGAGCACCCAGCTGCTGCTGAACGGCAGCCTGGCCC 
- - ^^GCCCGTGGTGAGCACCCAGCTGCTGCTCAACGGCAGCCTGGCCC 
"* AAGCCCGTGGTGAGCACCCAGCTGCTGCTGAACGGCAGCCTGGCCC 
* ^AAGCCCGTGGTGAGCACCCAGCTGCTGCTCAACGGCAGCCTGGCCX 
r^-i^CCCGTGGTGAGCACCCAGCTGCTGCTOAACGGCAGC 

-GTGGTGAGCACCCAGCTGCTC^i^ 
^CCGTGGTGAGCACCC^UjCTGCTGj^^AACX3GCAGCCTGGCCC 
ITGGTGAGCACCCAGCTGCIxiSfcAACGGCAGCCTGGCCX 




ICCGTGGTGAGCACCCAGCl 
'CCGTGGTGAGCACCCAGCTGC1 



CAGCGAGAACCTGACCGAG 
CGCAGCGAGAACCTGACCGAG 
gGCAGCGAGAACCTGACCGAGAM 
lGCGAGAACCTGACCGAGAAC 
AGAACCTGACCGJ 



\CGGCAGCCTGGCCC 
3AACGGCAGCCTGGCCC 



|CAAGACCATCATCGTG< 
\CCATCATCGTGC 
|gAAGACCATCATCGTG< 
SCAAGACCATCATCGTGC 
ICAAGACCATCATCGTGC 



.GCGAGAACCTGACCGAGAAC^CCAAGACCATCATCGTGC 
|CC5C^GCGAGAACCTX^CCGAGAA^S£aM 

^jcck^agcxiAgaacctgaccgagaag^ 

gCGCAGOTAGAACCTGACCGAGAAaSbAAGACCATCATCGTGC 

'tccag^agaacctgaccgagaacaScaagaccatcatcgtgc 



r^^ATCXJGCAGCGAGAACCTGACCGAGAACTlCCAAGACCATCATCGTGC 
£TCCGCAGCGAGAACCTGACCGAOAA(^C^^ 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqid!23 

seqidl24 

8eqidl26 

seqidl27 

seqid4 6 

seqid47 

seqidl25 

seqidl31 



r GTGGAGATCAACTGCACCCGCCCC^CAAC^CACCCXK^AGAGCC 
= ~ GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
- GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATC^CTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCC3CAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 



GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
GTGGAGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGAGCC 
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seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqid!22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
GGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 



rGGCCAGGCCTTCTACGCCACCAACGACGTGATCGGCAACATCCGCC 
£GGCCAGGCCTTCTACGCCACCAACGACGTGAT 



seqidll9 

seqidl20 

seqidl2 1 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27- 

seqid46 

seqid47 

seqidl25 

seqidl31r 



seqidll9/- 

seqidl2~0" 

seqidl21 

seqidl32^ 

seqidl33 

seqidl22- 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqid!25 

seqidl31 



r ;ATCILGCACCGACCGCTGGAACAAGACCCTGCA^^ 
^krrcAGCACCGACCGCTGGAACy^ACCXiTGCM 



jj^^GCACCGACCGCTGGAACAAGArccroC^ 

|j(5igcaccgaccgctggaacaagacc 
^agcaccgaccgctggaacaagaccctg^ 
lactcaccgctggaacaagaccct^^ 
Ecag^ccgaccgctggaac^ 




§3JG£GCACCGACCXK?TGGAACAAGACC 

GGTGATGAAGi 



GCGACCGCTGGAACAAGACCI 




\CAAGACCATCCAGTTCA^^G^^feO^GGCGACC 
^CAAGACCATCCAGTTCAA^GCiS 
IgCCCAAC^GACCATCC^TTCM^C^^ 
fcGCCC^CAAGACC^TCCAGTTCy^^ 
fOTCC^CAAGACCATCC^GTTC^^ 
UCCCAACAAGACCATCCAGTTO 
^TTCOZCAACAAGACCATC 
$t^CCAACAAGACC^TCCAGTTCAA(fctCCM 
£*^CTCCCCAACAAGACCATCCAGTTC^^ 
c VT^CCC^CAAGACCATCCAGTTC^^ 



■ TTCCCCAAC^GACCATCCAGTTCAAGCCCCAO^GXXJCGGCGACC 
:TTCCCCAACAAGACCATCCAGTTCAAGCCCCACGCCGGCGGCGACC 



seqidl!9 

seqidl2 0 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 



CACAGCTTGAACTGCCGCGGCGAGTTCITCTACItKIAACACCAGCJ 
CACAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGC; 
CACAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGC; 
CACAGCTTCAACTGCCGCC3GCGAGTTCTTCTACTGCAACACCAGCJ 
CACAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGCJ 
CACAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGCJ 
CACAGCTTCAACTGCCGCXX5CGAGTTCTTCTACTGCAACACCAGC7 
CACAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGa 
CACAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGC7 
CACAGCTTCAACTGCCGCGGCGAGTTCTrCTACTGCAACACCAGC/ 
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seqidl25 
seqidl31 



CAGAGCTTCAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGC; 
CACAGCTTGAACTGCCGCGGCGAGTTCTTCTACTGCAACACCAGCJ 



rsssr~ 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqid!26 

seqidl27^{ 

seqid46 

seqid47 ~ 

seqidl25 

seqidl3|^- 



s eqid 1 1 

seqidl2&^a , 
seqidl2i^; 1 c 
seqidl3|^:; . 
seqidl3|pgi.. 
seqidl22^>L>- 
seqidl23^§:- . 
seqid!24\p^^ 
secjidl2||i|f§ I 
s eqidl 2.2|j|§££ 

seqid47|^r^7> - 
seqidl2£ 



seqidll^^-. 

seqidl2tt^i, : 

seqidl2fev*.-* 

seqidl32 T?"" 

seqidl3i,i^:, 

seqidl22.\";-, 

seqidl23 • - 

seqidl24 \» 

seqid!26 J 

seqid!27 - 

seqid46 

seqid47 

seqidl25 

seqid!31 



ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGCi 
ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGC; 
ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGC; 
ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGCJ 
ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGC; 
ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGC; 
ACCT ACCACAGCAACAACGGCACCTACAAGT ACAAC GG CAACAG 01 
ACCTACC ACAGCAACAACGGCACCTACAAGTACAACGG CAACAG G? 
ACCTACCACAGCAACAACGGCACC TACAAGTACAACGGCAACAG CJ 
ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGCi 



ACCTACCACAGCAACAACGGCACCTACAAGTACAACGGCAACAGC; 
ACCTACCACAGCAACAACGQCACCTACAAGTACAACGGCAACAGCi 



CTGCAGTGCAAGATCAAGCAGATCGTGCGCATGTGGCAGGGCGTG< 

CTGCAGTGCAAGATC^AGCAGAT 

CTGCAGTGCAAGATCAAGp^jATCGTGCGCA 

CTGCAGTGCAAGATCAAGO^TOSTTC 

CTGCAGTGCAAGATCAAi&^ 

CTGCAGTGCAAGATCA^^OATCGTGCGCATGTGGCAGGGCGTGC 

CTGCAGTGCAAGATCA^dAGATCGTC^ 

CTGCAGTGCAAGATCAAGCAGATXX3TGCGCATC 

CTGGAGTGCAAGATCAMC^^ 

CTXX^GTGCAAGATCWpC^^ 

.CTGK^GTGCAAGATCM^ 

CTGCAGTGCAAGATCAAGCW^TOT 

CTGCAGTGC^GATCA^^^^X^CGCATGTGGCAGGGCG^ 

CTGCAGTGCAAGATOU!^ 

************ a****'!^ 

GCCCCCCCCATCGCCGGqU^TCACCTGCC^^ 
GCCCCCCCCATCGCCGCk!AACATCACCTGCa^GCAA^^ 
GCCCCCCCCATCGCCGqa^CAra 
GCCCCCCCCATCGCCXXxiAACATCACCTC^ 

GCCCCCCCCATCGCCGGC^CATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACa 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 

GCCCCCCCCATCGCCGGCAACATCACCTGCCGCAGCAACATCACCC 
****************** ************ 



-CSgfAG 

egggjcAo . - 



e*3CAGT *: 

vgt-; 



feTGCAGI^ 

*3+4rs*di * * > - 

.occccc : ■ 

OCCCCC 

gccct":*: . 
Gdcc 



seqidl!9 
seqidl20 
seqidl21 
seqidI32 
seqid!33 
seqidl22 
seqid!23 
seqid!24 



CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
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seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqid!31 



CGCGACGGCGGCTYTCAACACCACCAACAACACCGAGACCTTCCGC( 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 

CGCGACGGCGGC 

CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 
CGCGACGGCGGCTTCAACACCACCAACAACACCGAGACCTTCCGCC 



seqid!19 . 
seqidl20 
seqidl21 
seqidl32 
seqidl337C* 
seqidl22a$% 
seqid!22K- r 
seqidl24',j^- 
s eqi d 1 2 65j*a* . 
seqidl22i^x« 
seqid46^?' 
seqid47^v^: i 
seqidl25^ 
seqidl31^ 



seqidliiS 
seqidli 
seqidlZ 
seqidli 
seqidli 
seqidl^ 
seqidi23), 
seqidli|i 
seqidl^g 
seqidl2$ 
seqid46|^ 




ATGCGCGACAACTGGCGCAGCGAGCTGTACAAGTACAAGGTGGTGC 

ATGCGCGACAACTGGCGCAGCGAGCTGTACAAGTACAAGGTGGTGC 

ATGCGCGACAACTGGCGCAGCGAGCTGTACAAGTACAAGGTGGTGC 

ATGCGCGACAACTGGCGCAGOSAGCTGTACAAGTACAAGGTGGT^ 

ATGCGCGACAACTGGCGC^GCGAGCTGTACAAGTACAAGGTGGTGC 

ATGCGCGACAACTGGCGCAGCGAGCTGTACAAGTACAAGGTGGTG< 

ATGCGCGACAACTGGCGCAGCGAGCTGTACAAGTACAAGGT^ 

ATGCGCGACAACTGGCGCAGCGAGCTGTACAAGTACAAGGTGGT^ 

ATGCGCGAC^CIKXX:Gci^^ 

ATGCGCGAC^iACTGGCGGAGCGAGCTGTACAAGTACAAGGTGGTC 

i Z*ttZ'J - 

AlXXTGCGACAACTGGqSMjSCG^^ 
ATGCGCGACAACTGGCGCAQCGAGCTGT^ 




GGC^TCGCCCCC^CCAAGGfcCAAGCGC^ 
GGC^T(^CCCCACC$$^ 
GGCATCGCCCCCACCa|^ 
GGCATCGCCCCCACCaX^ 
GGCATCGCCCCCACCAAX^C^^ 
GGCATCGCCCCCAi 
GGCATCGCCCCCACCM| 
GGCATCGCCCCCACr^ 1 
GGCATCGCCCCCAC 
GGCATCGCCCCCACi 



*^~^ca^gccgcx3^ 

caagcgccgcgtggtgcagcgcgSsj 
^tcagcagcgtggtgcmagS^^ 

pUGCGCCGCGTGGTGCAiGCGCGS&:. 




seqidl23^f 
seqi 



seqidll^c 

seqidl2p;> 3 l" 

seqidl21>y^ 

seqidl32^-* 

seqid!33. 

seqidl22," 

seqidl23 - 

seqidl24 

seqidl26 " 

seqid!27 

seqid46 

seqid47 

seqidl25 

seqidl31 



:CAAGCGCCGCGTGGTGCAGCGCG^ 

. -^fflfe 



GGCATCGCCCCCACCAAGGCXS^ 
GGCATCGCCCCCACCAA<^CAAGCGCCGCG^ 



CATCGGCGCCGTCTTCCTGGGCTTCCTGGGCGCCGCCX^CAGCACC 

CATCGGCX3CCGTGTTCCTGGGCT 

CATCGGCGCCGTGTTCCTCK3GCTTCCTGGGC^ 

CATCGGCGCCGTGTTCCTGGGCTTCCTKX^CGCCX3CCGGCAGCAC^ 

CATCGGCGCCGTGTTCCTGGGCTTCCTGGGCGCCGCCGGCAGCACC 

CATCGGCGCCGTGTTCCTGGGCTTCCTGGGCGCCGCCGGCAGCACC 

CATCGGCGCCGTGTTCCTGGGCTTCCTGGGCGCCGCCGGCAGCACC 

CATCGGCGCCGTGTTCCTGGGCTTC CTGGGCGCCGCCGGCAGCACC 

CATCGGCGCCGTGTTCCTGGGCTTCCTGGGCGCCGCCGGCAGCACC 



CATCGGCGCCGTGTTCCTGGGCTTCCTGGGCGCCGCCGGCAGCACC 
C^TCGGCGCCGTGTTCCTGGGCTTCCTGGGCGCCGCCGGCAGCACC 



seqid!19 

seqidl2 0 CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 

seqidl2 1 CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 

seqid!32 CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
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seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 



CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 
CATCACCCTGACCGTGCAGGCCCGCCAGCTGCTGAGCGGCATCGTC 



seqidll9 
seqidl20 
seqidl21 
seqid!32 
seqid!33 
seqidl22 
seqidl23 
seqidl24 
seq±dl26 
seqidl27 
seqid46 
seqid47 
seqidl25 
■seqidl31 



seqidll9/ 
ieqidX20 
seqidl21 
seqidl32 
seqid!33; 
ffleqidl22 t 
seqidl23 
^seqidl24 
seqidl26 
seqid!27 
seqid46 
seqid47 
seqid!25 
seqid!31 



** seqidll9 
* seqidl20 
seqidl21 
seqidl32 
seqidl33 
seqidl22 
seqidl23 
seqidl24 
seqid!26 
seqid!27 
seqid46 
seqid47 
seqidl25 
seqid!31 



CCTGCTCAAGGCCATCGAGGCCCAGCAGCACATGCTGCAGCTGACC 

CCTGCTCAAGGCCATCGAGGCCCAGCAGCACATGCTGCAGCTGACC 

CCTGCTOAAGGCCATCGAGGCCCAGCAGCACATGCTGCAGCTGACC 

CCTGC!TCAAGGCCATCGAGGCCCAGCA<^ 

CCTGGTOAAGGCC^TCGAGGCCCAGCA^^ 

CCTGCTGAAGGCCATCGAGGCCCAGCA 

CCTCC^AAGGCC^TCGAGGCCCAGCAGaVC^TGCTGCAGCTGACC 
CCTOgTOAAGGCCATCGAGGCCCAG 

CCTGCTGAAGGCC^TCGAGGCCCAGCAGbvC^TGCTGCAGCTGACC 



CCrTGgg^GAAOT^ 
^C'^^Q^^CC^TCGAGGCCCAGGM^CATGCTGCAGC'reACC ■ 




MSGCCCGCGTGCTGGCCft 
3CCCGCGTGCTGGC 
GCAO^GCAGGCCCGCGTGCTGGCaS 
: GC^^QCAGGCCCGCGTGCTGGCCW 




GCTACCTGAAGGAC 
ICGCTACCTGAAGGAC 
iCGCTACCTGAAGGAC 
GGCTACCTGAAGGAC 



JCAGGCCCGCGTGCTGGCCAiOTAGCGCTACCTGAAGGAC 
GCttSgTGOUXSCCC^^ 

GCMGTOCAGGCCCGCGTGCTGGCCATCGAGCGCTACCTG^ 
GCAGCTGCAGGCCCGCX3TGCTGGCCM 

GCAGGTGCAGGCCCGCGTGCTGGCCATCGAGCGCTACCTGAAGGAC 

_ _ _ - • • • • ; 



GCAGCTGCAGGCCCGCX3TGCTGGCCATCGAGCGCTACCT 
GCAGCTGCAGGCCCGCGTGCTGGCCATCGAGCGCTACCTGAAGGAC 



CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 



CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
CATCTGGGGCTGCAGCGGCCGCCTGATCTGCACCACCGCCGTGCCC 
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seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGCAACAAGAGCGAGAAGGAGATCTGGGACAACATGACCIX3GATC 
GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGCAACAAGAGCGAGAAGGACATCTGGGACAACATGACCTGGATC 
GAGC^CAAGAGCGAGAAGGACATCTGGC^CAACATGACCrrGGAT< 



. GAG CAACAAGAG CG AG AAGG ACATCTGGGACAACATG ACCTGG ATC 
GAG CAACAAG AGCG AGAAGG ACATCTGGGACAACATGACCTGG ATC 



seqidll9 
;^seqidl20 
^fLeqidl2l- 
;seqidl32 
^ff€jqidl33/ 
.:?,g£qidl22' 
seqidl23 
-seqidl24 
^seqidl26 

^Hqidi27:: 

Sg£qid46/ 
SSqid47; 
-*agqidl25. 
t$idl3l. 




ggidll? - 
3<jidl2b;: 
•^qidl2iv 
!iSeqidl32. 
^S^qidl33 
i:seqidl22 . 
: Sgqidl23 
seqidl24 
seqidl26 
seqidl27. 
seqid46 
seqid47 
seqidl25 
seqidl3l 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

8eqidl23 

seqidl24 

seqidl26 

eeqidl27 

seqid46 

seqid47 



GATCAGCI^CTACACCGGCCTGATCTACAA^nX3CTGGAGGACAGC 
GATCAGCMdTACACCGGCCTGATCTAO^ 

GATCAGC^CTACACCGGCCTGATCTACAMGTCCTGGAGGACAGC 
GATCAOC^CTACACCGGCCTGATCTACA^GTGCTGGAGGACAGC 
GATCAGC^CTACACCGGCCTGATCTACATCCT 
GATCAGC|sACTACAC^ 

GATC^GC^C^ACACCGGCCTGATCTAC^ 
GAT" 
GAT' 




CTACACCGGCCTGATCTACA^CTGCTGGAGGACAGC 
CTACACCGGCCTGATCTACAAG^^ 




.CTACACCGGCCTGATCTACAfiGGTGCTGGAGGACAGC 



CACCGGCCTGATCTAl 




[GTGCTGGAGGACAGC 



GAAGAACtSft^^ 
GAAGAAGGJIGAAGGACCTG 
GAAGAACOAGAAGGACCTC^ 
GAAGAACGAOAAGGACCTGCTGGAGC^^ 
GAAGAACjSAGAAGG^ 
GAAG^COA^^ 
GAAGtfUu£3itoAAGGACCTGCT^ 

GAAGAACGAGAAGGACCTGCTGGAGCTGGACAAGTC^ 
GAAGAACGAGAAGGACCTGCTGGAGCTGGAC^ 

' 

GAAGAACGAGAAGGACCTGCTGGAGCTGGACAAGTGGAACAACCTC 
GAAGAACGAGAAGG ACCIGCTGGAGC TGGACAAGTGGAACAAC CTC 

CATCAGCAACTGGCCCTGGTACATCTAACTCGAG 

CATCAGCAACTGGCCCTGGTACATCTAACTCGAG - - 

CATGAGCAACTGGCCCTGGTACATCTAACTCGAG 

CATCAGCAACTGGCCCTGGTACATCTAACTCGAG 

CATCAGCAACTGGCCCTGGTACATCAAGATCTTCATCATGATCGTC 
CATCAGCAACTGGCCCTGGTACATCAAGATCTTCATCATGATCGTC 
CATCAGCAACTGGCCCTGGTACATCAAGATCTTCATCATGATCGTC 
CATCAGCAACTGGCCCTGGTACATCAAGATCTTCATCATGATCGTC 
CATCAGCAACTGGCCCTGGTACATCAAGATCTTCATCATGATCGTC 
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seqidl25 
seqidl31 



seqidll9 

seqid!20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



CATCAGCAACTGGCCCTGGTACATCAAGATCTTCATCATGATCGTC 
CATCAGCAACTGGCCCTGGTACATCTAA 

CCTGCGCATCATCTTCGCCGTGCTGAGCATCGTGAACCGCGTGCG( - ' 
- CCTGCGCATCATCTTCGCCGTGCTGAGCATCGTGAACCGCGTGCGC 
CCTGCGCATCATCTTCGCCGTGCTGAGCATCGTGAACCGCGTGCGC 
. ^CCTGCGCATCATCTTCGCCGTGCTGAGCATCGTGAACCGCGTGCG< - 
i^CCTGCGCATC^TCTTCGCCGTGCT^ 

--- .** — — - 

"«r ■' -4*..- 
" V*** "^7 ~ — — „ . m.-~? 

>'4CTGCGCATCATCTTCGrc v " 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqid!23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 . 

seqidl25 

seqidl31 



seqidll9 

seqidl20 

seqidl21 

seqidl32 

seqidl33 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

8eqid46 

seqid47 

seqidl25 

seqid!31 



KiCTOAGCITCC^GACCCTGACCd^MCCCCCGCGGCCTTC 
Rj^AGCTTCCAGACCCTGACC^ 
ECTGAGCTTCCAGACCCTGACC 
ginX^CTTCCAGACCCTGACCCG^GC 
CCI^GCTTCCAGACCCTGACC^ 




"G. 



SCg^GCITCCAGACCCTGACC^^ 






>V ^GGAGGAGGGCGGCGAGCAGGACCGCGACCGCAGCATCCGCCTGGTC, * 
- ^GGAGGAGGGCGGCGAGCAGGACCGCGACCGCAGCATCCGCCTGGTC 

'GGAGGAGGGCGGCGAGCAGGACCGCGACCX3CAGCATCCGCCTGGTC '>S- 
' : . GGAGGAGGGCGGCGAGCAGGACCGCGACCGCAGCATCCGCCTGGTC <* 



* GGAGGAGGGCGGCGAGCAGGACCGCGACCGCAGCATCCGCCTGGTC 0 



seqidll9 
seqidl20 
seqidl21 
seqidl32 
seqidl33 
seqidl22 
seqidl23 
seqidl24 



CCTGGCCTGGGACGACCTGCGCAACCTGTGCCTGTTCAGCTACCA( 
CCTGGCCTGGGACGACCTGCGCAACCTGTGCCTGTTCAGCTACCAC 
CCTGGCCTGGGACGACCTGCGCAACCTGTGCCTGTTCAGCTACCAC 
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seqid!26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



CCTGGCCTGGGACGACCTGCGCAACCTGTGCCTGTTCAGCTACCAC 
CCTGGCCTGGGACGACCTGCGCAACCTGTGCCTGTTCAGCTACCAC 



CCTGGCCTGGGACGACCTGCGCAACCTGTGCCTGTTCAGCTACCAC 



seqid!19 
seqidl20 
seqidl21 
seqidl32 
seqidl33 
seqidl22 
seqidl23 
seqidl24 
seqidl26' 
seqidl27. 
seqicUS?;:-' 
seqid47^C 
seqidl25> 
aec " 




seqig 
seqidij 
seqic 
seqic 
seqid^Tg 
seqid46- 
seqid$p£$ 
seqid^ir; 
seqidi31v 

seqidii9 

seqidiio 

seqidl21 

seqidi32 

seqid!33. 

seqidl22 

seqidl23 

seqidl24 

seqidl26 

seqidl27 

seqid46 

seqid47 

seqidl25 

seqidl31 



CATCCTGATCXX^CGTGCGCGCCGTGGAGCTGCTGGOTCACAGCAGC 

CATCCTGATCGCCGTGCGCGCCGTGGAGCTGCTGGGCCACAGCAGC 

CATCCTGATCGCCGTGCGCGCCGTGGAGCTGCTGC^CACAGCAGC 

CATCCTGATCGCCX3TGCGCGCCGTGGAGCTGCT^ 

CATCCTGATCGCQC^CGCGCCGTGG^ 



CATCCTGATCGGGQTGCGCGCCGTGGAGC' 






TCAGCAGC 



gcgcggctgggas^cctgaagtaci 
gcgcggctgggaSa^^ 

GCGCGGCTGGGj8iSTCCTC 

gcgcggctggg/^^^ctgaagtacctgggcai 
gcgcggctgggaqfttcctgaagtacctgggcaggi 




CAGTAC 
CAGTAC 
[TCCAGTAC 

rTAC 

ITGCAGTAC 



GCGCGGCIGGGA^TCCTGAAGTACC^^ 

. j«^fc -^aatoj. 

#' 



>£ *J*l.> ^ _ 

.'i i n"? ^ ' 



• - 1 ^ 



GAAGAAGAGCGCCATCAGCCTGCTGGACACCATCGCCATCACCGTC 
GAAGAAGAGCGCCATCAGCCTGCTGGACACCATCGeCATCACCGTC 
GAAGAAGAGCGCCATCAGCCTGCTGGACACCATCGCCATCACCGTC 
GAAGAAGAGCGCCATGAGCCTGCTGGACACCATCGGCATCACCGTC 
GAAGAAGAGCGCCATCAGCCTGCTGGACACCATCGCCATCACCGTC 



GAAGAAGAGCGCCATCAGCCTGCTGGACACCATCGCCATCACCGTC 



seqidll9 
seqidl20 
seqidl21 
seqid!32 
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seqidl33 

seqidl 2 2 CCGCATCATCGAGCTGGTGCAGCGCATCTGCCGCGCCATCCTGAAC 

seqidl2 3 C CGC ATCATCG AGCTGGTGCAG CGCATCTGC CG CGCCATCCTG AAC 

seqidl24 CCGCATCATCGAGCTGGTGCAGCGCATCTGCCGCGCCATCCTGAAC 

seqidl 2 6 CCGCATCATCGAGCTGGTGCAGCGCATCTGCCGCGCCATCCTGAAC 

seqidl27 CCGCATCATCGAGCTGGTGCAGCGCATCTGCCGCGCCATCCTGAAC 

seqid46 ' * 

seqid47 

seqidl 2 5 ■ " CCGCATCATCGAGCTGGTGCAGCGCATCTGCCGCGCCATCCTGAAC 

seqidl31 

seqidll9^ , ?\ \3\ 

seqidl 2 O^ggfo : r*±.- - 

seqidl 2 1%£ .2vi 

seqidl32~ife, rS£- ^ig- 

seqidl33 ^ |? * 

seqidl22^S^ . COTCCAGGGCTTCGAGbcCGCCCTGCTGTAACTCGAG 4^97 

seqidl 2 3 Jg^V. CCGCCAGGGCTTCGAGGCCGCCCTGCTGTAACTCGAG -22§2 9 

seqidl24g^r? ; CCGCCAGGGCTTCGAGGCCGCCCTGCTGTAACTCGAG -7$$2 9 

seqidl 2 Gjjj/ffi) 9 CCGCCAGGGCTTCGAC^CGCCCTGCTGTAACTCGAG^lS 4 * 

seqidl27:3ra't CCGCCAGGGCTTCGAG^CGCCCreOT 

seqid46l^^^p^" ' 4 * T jSr'*~'' ' { iV 

8eqid47!^K: t ||fc r — -U/j 

seqzdl 2 52Kgp > CCGCCAGGGCTTCGAGGGCGCCCTGCTGTAACTCGAG 

seqidl3'igSpr *. '4$Bs%- laifflE^ - 

PLEASE^J^E: Showing colors on lq£^%alignments is slow. 




Guide Tfe 




Seqid46: T 0. 00705, 
seqid47: 0.00705) 
:0 .00360, 
( 

seqidl25 : -0 . 00719, 

seqidl31:0. 00817) 

:0. 02358) 

:0. 01417, 

( 

( 

( 

( 
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seqidl22 : 0.01713, 
seqidl23:-0. 01713) 
:0. 00978, 

seqidl24: -0.00734) 
:0. 01374, 

seqidl26 : -0 . 01249) 
:0. 00803, 

seqidl27:-0. 00255) 
:0. 00380) 
:0. 00475, 
seqidl33:0.00545> 
:0. 00253, 
seqidl32: 0.00078^ 
: 0.00102, 
( & 
seqidll9 : 0 . 00677W 
seqidl20 : -0 . 00066) * 

:0.ooo8a, 

seqidl21 : 0 . 0027$)*; 



Cladogrim 



Right-click on the above tree to see display options. 

Problems printing? Read how to print a Phvlogram or Cladogram. 
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seqidnol20 GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTGGTGGA 60 

seqidnol21 GAATTCATGCGCGTGATGGGCACCCAGAAGAACTGCCAGCAGTG^ 60 

************************************************************ 

seqidnol20 CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGOT 120 

seqidnol21 CTGGGCTTCTGGATGCTGATGATCTGCAACACCGAGGACCTGTGGGTGACCGTGTACTAC 120 

************************************************************ 



seqidnol20 GGCGTGCCCGTGTGGCGCGACGCCAAGACCACCCTGTT 180 

seqidnol21 GGCGTGCCCGTGTGGCGCGACGCCAAGACCACCCTGTTC^ 180 vCC 



» * * i'v" 



seqidnol20 
seqidnol21 



seqidnol20 
seqidnol21 



seqidnol20 
seqidnol21 



eeqidnol20 
yh* seqidnol21 



.TACGAGACCGAGGTGC^CAAOSTC^ 240 r ^CZ^ r 

TACGAGACCGAGGTGCACAACGTCTGGGCCACCCACGC^^ 240 " ACGTC 

*^*^*******************^ ************************************ •* -<T*2* *<* 

♦ 5S/"~ -31-; 

CCCCAGGAGATCGTGCTGGGCAACGTGAC CGAGAACTTCAACATGTGGAAGAACGACATG 300 -OG^X J 
rCCCCAGGAGATCGTGCTGGGCAJSW^ 300 ^Sb^C 

/*** * ************ ************** ********** ********* - *f wiii 4 * 



^CCGACCy^ 360 J32sB$Sa: 

:^CGAC«GATGCACGAGGAc£^ 360 
j^********************£|&^ > **Y" 




, ^3-^ **^CCCCCCTGTGCGT^ 4 2 o »GL_ , * 

■^AGfCTGACCCCCCTGTGTO 42 b "^^^^P^ 

^Eg*_*****************^SS*S *************** **S^RS^w»* ********* - * •> - v ;^f3R|?9^* 



seqidnol20 
seqidnol21 



seqidnol20 
seqidno!21 



seqidnol20 
seqidnol21 



seqidnol20 
seqidnol21 



seqidno!20 
seqidnol21 



j^CGTGACCGGC^C^CA^^^ 480— 3_. 
J^TGACaXTCAAC^CACC^^ 480 ^^^^M- 

** W* *********** * ** * * * * '* **: * * ★ ★*★★★★»*»★★**** **S%rifr** ********** • c ^M*F- lS " 



^G^CTGCAGCTTCAACGf^ 540 : ^^gCfC : 

^ATt^GAACTGe^GCTC 54 0 ^^M^rr:^ 



- J; 



*;ATCACCC^GGCCTGCCCC^GGTGAGCTTTOACCCCATCCCCAT^ 6 00 'AASSTGAr 
_ATXAO:CAGGCCTG^^^ 600 V^&rGA 
^*^*******************^ ? '> • 

(krCGGCTACGCCATCCTGAAGTdCAACAACAAGACCTTC^^ 660 -AdT 

GCCGGCTACGCCATCCTGAAGTGCAACAACAAGACCTTCAACGGCACCGGCCCCTGCTAC 660 
**************************** ******************************** 



5- 

■ ? 



AACGTGAGCACCGTGCAGTGC^CCCACGGCATCAAGCCCGTGGTGAGCLACCCAGCTO 720 

AACGTGAGCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGGTGAGGACCCAGCTGCTG 720 " 
***************************************** ******************* 



seqidnol20 CTGAACCKKIAGCCTGGCCGAG 780 

seqidnol21 CTGAACGGCAGCCTGGCCGAGGAGGGCATCATCATCCGCAGCGAGAACCTC 780 

*******************************************************^ #AW 

seqidnol20 ACCAAGACCATCATCGTGCACCTGAACGAGAGCGTGGAGATCAACTGCACCCGCCCCAAC 84 0 

seqidnol21 ACCAAGACCATCATCGTGCACCTGAACGAGAGCGTGGAGATCAACTGCACCCGCCCCAAC 840 

************************************************** ## ** w ^ # ^ w ^ 



seqidnol20 AACAACACCCGCAAGAGCGTGCGCATCGGCCCCGGCCAGGCCTTCTACGCCACCAACGAC 900 
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seqidnol21 AACAACACCCGCAAGAGCGTGCGCATCGGCCCCGGCCAGGCCTTCTACGCCACCAACGAC 900 

***************************************************** ******* 

seqidnol20 GTGATCGGCAACATCCGCCAGGCCCACTGCAACATCAGCACCGACCGCTGGAACAAGACC 960 

seqidnol21 GTGATCGGCAACATCCGCGAGGCCCACTGCAACATCAGCACC^CCGCTGGAAG 960 

**************************** ******************************** 

seqidnol2 0 CTGCAGCAGGTGATGAAGAAGCTGGGCGAGCACTTCCCCAACT^GACGATCCAGTTCAAG 1020 

seqidnol21 CTGCAGCAGGTGATGAAGAAGCTGGGCGAGCACTTCCCCAACAAGACCATCCAGTTCAAG 1020 

seqidnol20 . . CCCCACGCCGGCGGCGACCTGGAGATCACCATGCACAGCTTCAACTGCCGCGGCGAGTTC 1080 
seqidnol2 1 CCCCACGCCGGCGGCGACCTGGAGATCACCATGCACAGCTTCAACTGCCGCGGCGAGTOC - 1080 



seqidnol20 
seqidnbl21 



seqidnol20 
seqidnol21 



seqid^oi20 
seqi<inol21 




seqic}j}pl20 
seqidQOl21 



seqidribl20 
seqidno!21 



seqidnol20 
seqidno!21 



TTCTACTGCAACACC^GCAACCTGTTCAACAGCACOTACC^ 1140 

TTCTACTGCAACACCAGCAACCTGTTCAACAGCACCTACCACAGCAACAACGGCACCT :il40 
******************************************************* *-*■#* *>' ■ 



AAGTACAACGGCAACAGCAGCAGCCCCATC^CCCrG 1200 
AAGTACAACGGCZACAGCAGCAGCCCCATCAC^ 0 0 

****************** ************** ******** ******* ******* **S^*t* v ** •* 



CGCATGTGGCAGGGCGTGGGCCAGGCCACCTACG(^CCCCCATCGCCX^ 

CGCATGTGGCAGOGCGTGGGCCA 260 
*********** *-jT* ******************* *i^ 




TGCCGCAGCAACAl^CC^ 0 
TGCCGC^CAAi^CACOSG^^ 2 0 

^^*"*******************^^ 

AACACCGAGACCffiTCCC^CC 8 0 

AAC^CCGAGAC^pt^^ 8 0 

* * * ******* ***^**** ********* * * * ***rf*jg^**** ************* ^f^^^*- ~ 

TACAAGTACAAGGTGGTGGAGATCAAGCCC 





.TCGCCCCCACaU^CCAAddS©? 14 4 0 



TACAAGTACAAGGTTCTGGAGAT 4 o 

***********************************^*^ c - 



cgcgtggtgcagtocgagaagcgcgccgtgggc^tc^ 0 0 

agcgtggtgcag^gcgagaagagcgccgtgggc^ 1500 
***********-*;******** **********************************~*^* : * 

GGCGCCGCCGGCAGCACCATGGGCGCCGCCAGCAT&CC^ 1560 

GGCGCCGCCGGCAGCACCATGGGCGCCGCCAGCATCACCCTGACCGTGC^ 1560 
****************** ****************************************** 



seqidnol20 CTGCTGAGCGGCATCGTGCAGCAGCAGAGCAACCTGCTGAAGGCCATCGAGGCCCAGCAG 1620 

seqidno!21 CTGCTGAGCGGCATCGTG CAGCAGC AGAGCAACCTGCTGAAGGC C ATCGAGGC CCAGCAG 1620 

************************************************************ 

seqidnol20 CACATGCTGCAGCTGACCGTGTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTGGCCATC 1680 

seqidnol21 CACATGCTGCAGCTGACCGTGTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTGGCCATC 1680 

******* ***************************************************** 

seqidnol20 GAGCGCTACCTGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTGCAGCGGCCGCCTGATC 1740 

seqidnol21 GAGCGCTACCTGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTC 1740 

************ ************************************************ 



seqidnol20 TGCACCACCGCCGTGCCCTGGAACAGCAGCTGGAGCAACAAGAGCGAGAAGGACATCTGG 1800 
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seqidnol21 TGCACCACOTCCGTGCCCTGGAACAGCAGCTGGAGCAACAAGAGCGAGAAGGACATCTGG 1800 

seqidnol20 GACAACATGACCTGGATGCAGTGGGACCGCGAGATC^GCAACTAC^ 1860 

seqidnol21 GACAACATGACCTGGATGCAGTGGGACCGCGAGATC^GGAACTACA I860 

seqidnol20 AACCTGCTGGAGGACAGCCAGAACCAGCAGGAGAAGAACGAGAAGGACCTGCTG^ 1920 . 

seqidnol21 AACCTGCTGGAGGACAGCCAGAACCAGCAGGAGAAGAACGAGAAGGACCTGCTGGAGCTG 1920 ^ V v~' * 

♦A********************************************************** , t , * * * 



seqidnol20 GACAAGTGGAACAACCTGTGGAACTGGTTCGACATCAGCAACT^ 1980 \'J . 

seqidnol21^ GACAAGTGGAACAACCTGTGG^CTGGTTCGA 1980 >09PCrrr 
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TABLE 1. Subtype-specific full-length reference sequences from 
the HIV Sequence Database 



US 



LTR 



9*9 



pol 



Atr\ 



vpr 
W/QrevF 

□ D 

□ DO 

tat vpu 

A- 



env 




LTR 

in 



revQ\ 1 

tat nef 



A — QP 

? — Ts 



120 
SU) 



FIG. 1. HIV-1 subgenomic regions utilized for epidemiological 
linkage analysis. A schematic representation of the HIV-1 genome is 
shown, with brackets denoting the subgenomic gag, gpl20, gp41, and 
LTR fragments that were amplified for diagnostic PCR analysis. Over- 
lapping gag and gpl20 regions are denoted by capital letters (gagA to 
-//; gpl20A to -C) and are referred to in Tables 2 to 4. 



the ability to confirm, with a high level of confidence, epide- 
miological linkage of HIV-1 transmission between members of 
all putative transmission pairs. 

Molecular analyses of suspected transmission links have 
been widely used to characterize localized HIV-1 outbreaks, 
mother-to-infant transmission, sexual transmission, sharing of 
contaminated needles, donation of contaminated blood, re- 
ceipt of contaminated clotting reagent, nosocomial transmis- 
sions from health care workers, and intrafamilial contacts (1, 4, 
5, 6, 13, 18, 21, 29, 36, 42, 44). In all of these cases, the 
establishment of epidemiological linkage relied on the docu- 
mentation of closer genetic relatedness between viruses infect- 
ing the suspected transmission pair(s) compared to control 
viruses isolated from unrelated individuals in the same region. 
Here we developed a similar approach to confirm (or refute) 
heterosexual transmission among discordant couples within 
the ZUHRP cohort. 

Blood samples were collected between 1996 and 2000 from 
both partners of 149 (of a total of 162) discordant couples in 
whom seroconversion had been documented. The time period 
between the last negative and the first positive blood tests for 
the seroconvertor (which in most cases was also the blood 
sample used for linkage analysis) was 4.9 months on average, 
but in some cases it extended up to 4 years. High-molecular- 
weight DNA was extracted from whole blood or Ficoll gradi- 
ent-purified peripheral blood mononuclear cells by using the 
QIAamp Blood Kit (Qiagen, Valencia, Calif.). For a small 
number of couples, DNA was extracted from dried blood spots 
(9). Because of the known variability of HIV-1, different re- 
gions of the HIV-1 genome were targeted for PCR amplifica- 
tion, resulting in comparisons of gag, gpl20, gp41, and/or long 
terminal repeat (LTR) regions as shown in Fig. 1. Although 
the gp41 primers were by far the most cross-reactive, the suit- 
ability of this primer set was discovered only after alternative 
genomic regions from a number of transmission pairs had 



Subtype_sequence name 


Accession no. 


Reference 


A_Q2317 


ArULWooj 


i% 
5i 


A_SE8891 


Aruoyo/j 


I 


A_SE8538 


Arooyooy 


n 

I 


A_SE6594 


ArUoyo/Z 


7 


A_SE7535 


ArUoyo/I 


1 


A_SE7253 


Ar 069670 


1 


ASE8131 


a m fYinn 1 
Ar 107771 


n 

1 


A_U455 


M6z3z0 


Zo 


A_92UG037.1 


U51190 


1 A 
14 


C_96BW04.07 


ArllUyoi 


ZO 


C 96BW11B01 


AFl 1 0971 


zo 


C_96BW15C02 


AFl 1 0974 


ZD 


C 96BW05.04 


AFl 10968 


ZD 


C_96BW16.26 


AFl 10978 


ZD 


C_96BW12.10 


AFl 1 0972 


ZD 


C_96BW17A09 


AFl 10979 


ZD 


C 96BW01B21 


AFl 10960 


ZD 


C_C2220 


U46016 


1C 
55 


C 94IN11246 


AF067159 


Z4 


C_98BR004 


AF286228 


55 


C_98IS002.5 


Arzo6z33 


11 


C_98TZ013.10 


AF286234 


11 
55 


C_98TZ017.2 


AfZo6Z3j 


j5 


C_97ZA012.1 


AF286227 


11 
55 


D 94UG114.1 


U 88824 


14 


D 84ZR085 


U88822 


14 


D NDK 


. M27323 


37 


D ELI 


K03454 


2 


D Z2Z6 


M22639 


38 


G DRCBL 


AF084936 


27 


G HH8793 


AF061641 


8 


G 92NG083.2 


U88826 


14 


G SE6165 


AF061642 


8 


J SE91733 


AF082395 


23 


J SE92809 


AF082394 


23 



already been analyzed (43). LTR, gpl20, and gp41 primers and 
amplification conditions have been described previously (15, 
16). The primers that were used to amplify sequences within 
gag were cgagA 5 ' -TG AT AAAACCTCC AATTCCCCCTA 
T-3' and PBS1A 5'-TTTGCCTGTACTGGGTCTCTCTGGT 
T-3' in the first round and cgagB 5 '-AATACTGTATCATCT 
G CTCCTGTATC-3 ' and PBS1B 5 ' -G CTTAAG CCTC AATA 
AAGCTTGCCTT-3 ' in the second round. PCR products were 
sequenced directly, using cycle sequencing and dye terminator 
methodologies, on an automated DNA sequencer (model 
377A; Applied Biosystems, Inc., Foster City, Calif.). Both 
strands of the PCR products were sequenced (sequences are 
available under GenBank accession numbers AF404868 
through AF405203, AF406742, and AF406743). Although pop- 
ulation-based sequencing was used to allow analysis of the 
predominant viral form in each individual, the number of am- 
biguous base pairs in the entire data set was <0.3%. 

To establish suitable linkage criteria for HIV-1 strains in- 
fecting the Zambian couples, amplified viral sequences were 
first subjected to preliminary phylogenetic tree analyses to 
identity all circulating HIV-1 group M subtypes (not shown). 
Full-length and nonrecombinant reference sequences repre- 
senting these subtypes were then obtained from the Los 
Alamos HIV Sequence Database (Table 1) and subjected to 
pairwise sequence comparisons in the genomic regions corre- 
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To better understand the virological aspect of the expanding AIDS epidemic in southern Africa, a set of 23 
near-full-length clones of human immunodeficiency vims type 1 (HIV-1) representing eight AIDS patients from 
Botswana were sequenced and analyzed phylogenetically. All study viruses from Botswana belonged to HIV-1 
subtype C. The interpatient diversify of the clones from Botswana was higher than among full-length isolates 
of subtype B or among a set of full-length HIV-1 genomes of subtype C from India (mean value of 9.1% versus 
6.5 and 43%, respectively; P < 0.0001 for both comparisons). Similar results were observed in all genes across 
the entire viral genome. We suggest that the high level of HIV-1 diversity might be a typical feature of the sub- 
type C epidemic in southern Africa. The reason or reasons for this diversity are unclear, but may include an 
altered replication efficiency of HIV-1 subtype C and/or the multiple introduction of different subtype C viruses. 



The majority of new human immunodeficiency virus (HIV) 
infections in the global AIDS epidemic are appearing in sub- 
Saharan Africa and Southeast Asia. Compared with the situa- 
tion a decade ago, the main AIDS epidemics have shifted from 
central and eastern Africa to the southern regions. The most 
severe HIV epidemics have recently afflicted such southern 
African countries as Zimbabwe, Zambia, Namibia, South Af- 
rica, and Botswana (43). HIV-1 subtype C has been estimated 
to account for 48% of HIV-1 infections worldwide and 51.5% 
of HIV-1 infections in Africa (4, 7, 14-16, 21, 31), where the 
main mode of transmission is heterosexual (43, 44, 47). 

A rapid expansion of the HIV-1 epidemic in Botswana has 
occurred since the early to mid 1990s. According to the 
UNAIDS and World Health Organization (WHO) Global 
HIV/AIDS & STD Surveillance data, HIV prevalence among 
antenatal clinic attendees tested in the major urban areas of 
Botswana (Gaborone, Francistown, and Selebi-Phikwe) in- 
creased from 6% in 1990 to 39% in 1997 (range of 34 to 43%) 
(42). Among women 20 to 29 years of age, 43 to 44% tested 
HIV positive. Outside of the major urban areas, median HIV 
prevalence increased from no evidence of infection in 1985 to 
1987 to 34% in 1997. In 1997, HIV prevalence in Botswana 
ranged from 28 to 38%. As such, locally circulating HIV-1 
needs to be characterized thoroughly, and vital information 
about the nature of the epidemic should be extended (2, 4, 7, 
21, 31, 37, 45-47). Moreover, Botswana's central .geographic 
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position makes a comprehensive HIV-1 molecular epidemio- 
logical study that much more urgent, because it may serve as an 
example of the burgeoning epidemic in southern Africa. 

In this study, we report the molecular cloning and phyloge- 
netic analysis of 23 near-full-length clones from Botswana. All 
of them were identified as belonging to HIV-1 subtype C and 
demonstrated high levels of intersample diversity across the 
entire viral genome. By providing new genetic information 
regarding locally circulating viruses, this study may contribute 
to AIDS vaccine design for the southern Africa region coun- 
tries and, in particular, for Botswana. 

Specimens for this study were selected from HlV-seroposi- 
tive patients in Gaborone, Botswana. All HIV-1 infections in 
this study were likely to be heterosexually acquired. The times 
of infection were not known. The HIV-1 -seropositive status of 
patients was confirmed by enzyme-linked immunosorbent as- 
say and Western blot analysis. Clinical classification was per- 
formed by using the 1987 Centers for Disease Control and 
Prevention (CDC) revised classification (9) (data not shown). 

Genomic DNA was obtained directly from the patients' pe- 
ripheral blood mononuclear cells (PBMCs)— buffy coats— 
without passage through cell culture or donor PBMCs. All 
clones in this study were amplified in heminested PCR with 
three primers from the LA set (18) or their modifications. The 
Expand Long Template PCR system (Boehringer Mannheim, 
Indianapolis, Ind.) was used according to the manufacturer's 
instructions. Gel purification of the first-round PCR product 
was essential for direct amplification of 9.0-kb fragments from 
uncultured PBMCs. Estimation of the expanded PCR sensitiv- 
ity (based on 8E5/LAV) revealed a successful amplification of 
the 9.0-kb fragment in the first round when at least 8 X 10 2 to 
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PIG i Phylogenetic relationship of the newly characterized full-length clones from Botswana (boxed in black) to other representative full-length HIV-1 sequences 
of subtypes A, B, C, D, F, and H and recombinant subtypes AE, AG, and BF. Full-length subtype C sequences from India were also included m the analysis. A 
neighborjoining tree was constructed on the basis of the hidden Markov model nucleotide alignment of full-length HIV-1 genomes. Subtype O ANT70 sequence was 
used as an outgroup. Values along the branches indicate the bootstrap values that support branching (out of a 1,000 resampling). 



4 X 10 3 proviral copies were present in the reaction (data not 
shown). These results were consistent with those from other 
studies (10, 39). The TA pCR2.1 TOPO system (Invitrogen, 
Carlsbad, Calif.) and JM109 competent cells (Promega Cor- 
poration, Madison, Wis.) were used for cloning. Positive col- 
onies were screened by PCR. To obtain sufficient plasmids 
for sequence analysis, we amplified the constructs under the 
previously described conditions with some modifications (41). 
Purified plasmid DNA served as a template for sequencing. 
Both-strand sequencing was combined with a strategy involving 
overlapping sequences. Dye terminator sequencing on an au- 
tomated DNA Sequenator (model 373A; Applied Biosystems, 
Inc., Foster City, Calif.) was used. 

A multiple alignment procedure for the full-length HIV ge- 
nome was performed by using the hidden Markov model. 
Constructed through the HIV-1 HMMER computer program 
of the Los Alamos National Laboratory, the model has been 
previously shown to provide the best description of the true 
nucleotide substitution pattern of HIV-1 gag and env genes 
(26). The HIV-1 HMMER model (11, 12) constructed at Los 
Alamos National Laboratory for the full-length HIV-1 ge- 



nomes (24) was employed. Sixty full-length reference se- 
quences were included in the alignment from the GenBank 
data bank (5). The 3' end of the alignment, which included the 
nef coding region and 3' long terminal repeat (LTR), was ad- 
justed manually. The pairwise evolutionary distances from nu- 
cleotide sequences were computed by the DNADIST program 
under Kimura's two-parameter model (17). All alignments 
were globally gap stripped for the generation of the trees. The 
transition/transversion ratio parameters were set at 3.0 for the 
gag gene, 1.5 for the env gene, 1.42 for the V1-V2 and V3 
fragments, and 2.0 for the other viral loci (25). A tree was 
drawn by the Njplot (33) and Tree View (32) programs. To 
analyze patterns of variability along the HIV-1 genomes, the 
program SWAN, which utilizes a "sliding window" approach 
was used (34), Positions with gaps either were or were not ex- 
cluded from the analysis. The variability distribution was es- 
timated as an entropy function of the nucleotide variation 
observed at a particular position. The Recombinant Identifi- 
cation Program (RIP) (40) and HIV-1 Subtyping Basic BLAST 
(3) were used in searching for recombination among the clones 
studied. 
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FIG. 2. Variability plots comparing subtype B and C (Botswana) sequences across the entire H1V-1 genome. The variability distribution was estimated as an entropy 
function of the nucleotide variation in the SWAN program based on the hidden Markov model alignment of the complete HIV-1 genome. The subtype B sequences 
used in the analysis were DH123, 89.6, RF, WEAU, OYI, HXB2, JRFL, and YU2. n, number of sliding window sites across the HIV-1 genome with gap stripping. 



Sequence analysis of the Botswana HIV-1 revealed that 10 
of 23 clones had an intact genomic organization with open 
reading frames. The other clones had point mutations and/or 
insertions and deletions resulting in frameshifts, disabled start 
codons, or premature stop codons. No major deletions or re- 
arrangements were observed. Determined length polymor- 
phism among studied sequences was limited to the vpu (15- to 
18-nucleotide [nt] insertion at the 5' end), env (from 3- to 9-nt 
deletions to 48-nt insertions, GIGRGQ motif in the BW17 V3 
loop), 2nd exon of rev (a 13-amino-acid truncation at the 3' 
end), nef(6- to 15-nt deletions in some clones), and regulatory 
regions of the LTR (three or four NF-kB sites with GGGAC 
TTTCT as a potential fourth NF-kB in two clones of BW05). 

An evolutionary tree in Fig. 1 shows the phylogenetic rela- 
tionship of the full-length Botswana clones to other represen- 
tative full-length HIV-1 sequences. All Botswana sequences, a 
set from India (27), and two subtype C reference sequences 
(C-ETH2220 [38] and C-92BR025 [19]) clustered together, 
forming a compelling subtype C outcropping on the phyloge- 
netic tree. This cluster was separated from other HIV-1 se- 
quences by the extremely high bootstrap value of 1,000 (out of 
1,000 resampling). Phylogenetic relationships within the sub- 
type C bush were also noteworthy. Assuming that the circled 
node at the center of the bush could represent the potential 
ancestral subtype C node, we observed the following, (i) The 
star-like phylogeny of the subtype C bush together with its 
branching order may demonstrate the relatively high diversity 
of the Botswana samples, (ii) Four Botswana sample clones 
(BW01, BW05, BW15, and BW16), together with all five se- 
quences from India, formed a potential subcluster, although 
the bootstrap value was not high, (iii) Ail Indian samples were 
separated by bootstrap values of 1,000, possibly reflecting a 
"founder effect" among these sequences, (iv) Three Botswana 
sample clones (BW04, BW11, and BW12) may represent indi- 
vidualized groups of sequences inherited from a common sub- 



type C ancestor, (v) Two reference sequences ETH2220 and 
92BR025 deviated together with a high bootstrap value (1,000), 
possibly reflecting another subtype C subcluster differing sig- 
nificantly from Botswana or Indian samples, (vi) One of the 
Botswana samples (BW17) strayed rather far off the main 
subtype C bush and may be the least representative of Bots- 
wana HIV-1 samples, (vii) The topology of the Botswana 
clones confirms that clones of the same samples are closely 
related to each other based on full-length genome sequences. 
A multilocus analysis was congruent with full-genome phylo- 
genetic analysis and confirmed clustering of newly derived Bots- 
wana clones within subtype C across the entire HIV-1 genome 
(data not shown). 

To characterize the level of variability among Botswana 
clones across the entire HIV-1 genome, we performed SWAN 
program analysis as an entropy function of the nucleotide 
variation. The Botswana set had greater variability than sub- 
type B samples (Fig. 2) (mean values of 11.6 and 8.5%, respec- 
tively, for gap-stripped analysis). The profiles of viral variability 
across the HIV-1 genome were similar among subtype B and C 
viruses. Comparison of gap-stripped and gap-nonstripped plots 
revealed that the differences in mean values between the two 
methods of computing and the shape of variability plot profiles 
were not significant (data not shown). Gap stripping slightly de- 
creased the mean value and the number of sliding window sites 
across the genome. It also hid the extreme regions with the 
highest level of variability. Both variability when measured as 
an entropy function in this study and when described before 
diversity as a pairwise comparison of the sequence (19) exhib- 
ited similar profiles of variable and conservative genomic re- 
gions. Variability plots (especially non-gap stripped) revealed 
higher peaks in variable regions than diversity plots. 

Table 1 depicts the high degree of intersample diversity 
across the entire HIV-1 genome among Botswana clones com- 
pared with subtype B and C sequences from India (27). Be- 
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TABLE 1. Intersample HIV-1 diversity in this study 11 
Mean % (range) HIV-1 diversity 6 
Gene or Subtype B Subtype C, 



region 


Subtype C, 
Botswana 


AIDS patients 
yo sequence:^ 


23 isolates 


India 


m 


7.9 (5.9-9.2) 


5.6* (3.6-9.6) 


4.9* (1.0-8.6) 


3.4* (2.4-^.6) 


pol 


5.9 (4.3-7.8) 


4.1* (2.9-5.4) 


3.9* (0.8-6.2) 


2.6* (1.5-3.7) 


vif 
VI] 


7 5 (4 3-12 S\ 


6 3t C3 7-9 3) 


6.4$ (0-9.5) 


3.2* (1.8-4.3) 


vpr 


Q R fS 1-14 


6 2* (4 3-8 41 


5.9* (1.7-10.8) 


4.8* (2.1-8.1) 


tat 




7 m /win 7^ 


7 1* (3 4-10 71 


3.7* (2.7-5.1) 


rev 


10.4 (7.7-16.6) 


9.4§ (4.2-14.2) 


8.2* (3.3-14.2) 


4.7* (2.8-5.7) 


vpu 


13.9(10.1-18.8) 


10.8* (6.1-14.3) 


9.5* (4.2-15.8) 


5.1* (2.1-8.1) 


env 
V1-V2 
V3 


12.4(10.4-14.5) 
25.7(15.9-36.7) 
14.4(11.5-18.9) 


9.5* (6.2-11.9) 
18.0* (8.6-26.1) 
11.6* (6.7-17.3) 


9.5* (5.9-12.7) 
19.0* (8.6-30.8) 
12.1* (6.0-18.2) 


6.9* (5.1-9.3) 
25.8 (14.2-34.8) 
6.4* (5.1-7.9) 


nef 


11.3 (8.0-15.1) 


9.8[! (6.3-15.7) 


9.2* (3.5-16.3) 


5.2* (4.4-6.4) 


y LTR 


9.7 (6.6-13.4) 


8.3# (5.1-11.6) 


7.3* (1.0-11.6) 


4.0* (3.2-5.4) 


Full-length 


9.1 (7.7-10.7) 


6.6* (4.5-8.0) 


6.5* (3.5-9.6) 


4.3* (3.2-5.7) 



genome 



8 Fifty-six full-length HIV-1 sequences were used in the analysis, based on the 
hidden Markov model alignment of the entire HIV-1 genome. The following 28 
sequences of subtype B were used: AUMBCC54, C18MBC, DH123, 89.6, RF, 
WEAU, HAN, HIVMN, BCSG3, OYI, CAM1, NY5, pNl>*3, LAI, HXB2, 
JRFL, JRCSF, AUMBC925, AUMBC200, YU2, YU10, ACH320A, ACH320B, 
SF2, HIV1AD8, D31, MANC, and WR27. Sequences AUMBCC54, C18MBQ 
and NY5 were excluded from the nef and 3' LTR analysis because of deletions 
or the absence of sequences for these regions. Pairwise distances in four groups 
of sequences (pNL43, LAV, and HXB2; JRFL and JRCSF; YTJ2 and YU10; and 
ACH320A and ACH320B) were excluded from the analysis. The eight subtype B 
sequences from AIDS patients were JRFL, YU2, 89.6, RF, HAN, MN, BCSG3, 
and WR27. Thirteen sequences of subtype C were included in the analysis: 8 
clones from Botswana (1 from each patient) and 5 sequences from India (301999, 
21068, 301905, 301904, and 11246). All distances were calculated by DNADIST 
program from the PHYLIP v. 3.572 package based on hidden Markov model 
alignment. The transition/transversion ratios were set to 3.0 for gag, 1.5 for env, 
1.42 for V3 and V1-V2, and 2.0 for all other HIV-1 genes. 

b Statistical significance versus Botswana HIV-1 clones is shown as follows: *, 
P < 0.0001; #,P = 0.005; ||, P = 0.013. 0.028; t,/> = 0.033; and §,P ~ 0.13. 



cause AIDS patients might be expected to have higher vari- 
ability, we made the same comparison, limiting the subtype B 
reference to eight sequences selected from confirmed AIDS 
patients (column 2). The intersample diversity among Bots- 
wana clones was significantly higher than that among subtype B 
references or Indian samples on the level of the full-length 
HIV-1 genome. Across the viral genome, the mean diversity 
among Botswana samples was congruent with the full-length 
genome analysis. The intersample diversity analysis statistically 
confirmed the phylogenetic study observations (Fig. 1 and 2) 
that the newly characterized Botswana clones were highly di- 
versified. 

The results of intrasample diversity analysis were limited by 
the methods used (PCR amplification and cloning) and by the 
available number of multiple subtype B full-length clones. Sev- 
en Botswana samples (except BW12) and three subtype B sets 
(JRFL, YU, and ACH320) were compared across the HIV-1 
genome. The range of full-length diversity among Botswana 
samples was 0.3% (BW17 clones) to 2.9% (BW04 clones), with 
an average mean value of 1.4%. Intrasample diversity showed 
no significant difference between subtype B and C sequences 
(Fig. 3). However, two concentrations of diversity (low and 
high) were revealed among both subtype B and C sets (Fig. 3). 
These concentrations of low and high diversity were distribu- 



ted across the entire genome and were found to be more con- 
sistent in the structural genes {gag, pol, and env). 

All Botswana sequences were checked for potential recom- 
bination sites by the HIV-1 Subtyping Basic BLAST (3) and by 
RIP (40), the results consistently showing no evidence of re- 
combination. 

Clustering with HIV-1 subtype C and the high intersample 
diversity were the most exceptional attributes of the 23-clone 
set from Botswana. A star-like shape of the subtype C cluster 
in the phylogenetic tree was accompanied by extremely high 
bootstrap values across tree branches. The topology of the 
phylogenetic tree suggested that a common ancestor for the 
Botswana sequences might have existed before the common 
ancestor for the Indian sequences analyzed or before the 
strains C-92BR025 (Brazil) and C-ETH2220 (Ethiopia) di- 
verged. 

Intersample diversity within subtype C has been previously 
found to vary from 5 to 11.5% (1, 7, 38). Higher levels of 
diversity were found among Botswana clones in this study, in 
spite of the fact that samples were taken from one place and at 
one time point. Both full-genome sequences and multiple sub- 
genomic loci demonstrated the same patterns, with a higher 
mean value of variability among the Botswana samples. 

The increased genetic diversity of subtype C viruses in Bots- 
wana might have different underlying causes, including a vari- 
ety of host and viral factors. Among the latter factors, a com- 
bination of the genetic flexibility of subtype C virus and its 
multiple introductions might be the most important. A number 
of recent findings argue that one possible cause of the high 
viral diversity in the Botswana epidemic could be higher flex- 
ibility of subtype C virus and its altered ability to diversify. 
These arguments include, but are not limited to the following, 
(i) Subtype C is predominant in most recent HIV-1 epidemics 
worldwide (1, 7, 27, 35, 36, 38, 45, 47). (ii) The highest prev- 
alence of HIV-1 infection in various epidemics is caused main- 
ly by subtype C virus, (iii) Subtype C virus may have a faster 
disease progression (20), and patients infected with HIV-1 sub- 
type C developed AIDS earlier than patients with subtype A 
virus (23). (iv) Three or four NF-kB sites (instead of two) might 
lead to more efficient viral transcription (13, 29, 30). (v) The 
TNF-a response to subtype C virus is significantly higher than 
to HIV-1 subtype B (28, 29), suggesting the possibility of in- 
creased viral transcription and replication in correlation with 
NF-kB copy number (28, 29). (vi) The viral load of subtype C 
infections may be higher in different compartments that might 
cause an increased level of viral transmission (22). On the oth- 
er hand, a scenario that suggests independent diversification of 
the virus in other regions and delayed entry of the epidemic in 
Botswana, followed by multiple introductions of the subtype C 
virus from adjacent countries cannot be excluded (42-44). 

Botswana is geographically located at the center of the AIDS 
epidemic in Southern Africa. UNAIDS and World Health Or- 
ganization surveillance data suggest that the widespread rise of 
the HIV-1 epidemic in Botswana started in the early to mid- 
1990s and reached one of the highest prevalence rates in Africa 
(42-44). For more recent HIV-1 epidemics, such as those de- 
scribed in Thailand and India, one might expect to find a highly 
homogeneous pool of local viruses that formed a monophyletic 
phylogenetic subcluster with relatively short and aggregated 
branches. However the findings in this study contradict the 
established trend. 

Extremely high interpatient diversity across the genome was 
supported by long branch lengths in the phylogenetic trees 
throughout the Botswana viruses within the genetic subtype 
C. No multiple subtypes or recombination were found in this 
study. Because it currently has the highest incidence rates of 
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FIG. 3. Intrasample diversity among subtype B (open bars) and newly derived subtype C (solid bars) clones from Botswana. Pairwise distances were computed by 
the DNADIST program from the PHYLIP package based on hidden Markov model alignment of the complete genome or individual genes or subgenomic regions (B) 
retrieved from the same alignment. Three sets of subtype B sequences were included: JRFL and FRCSF, YU2 and YU10, and ACH320A and ACH320B. Seven sets 
of subtype C Botswana clones were included: BW01, BW04, BW05, BW11, BW15, BW16, and BW17. The order of bars for each gene or region corresponds to the 
order in which the sets are mentioned. Dashed ovals depict low- and high-diversity groups. 



HIV-1, the region of southern Africa, including Botswana and 
Zimbabwe, is an important site for the design and develop- 
ment of an anti-HIV vaccine. Recent studies of cross-clade 
cytotoxic T-lymphocyte recognition (6, 8, 48) have made a 
significant contribution to AIDS vaccine development. How- 
ever, the extent of cross-clade immune responses in a highly 
diverse HIV-1 environment and the mechanism of vaccination 
throughout different HLA profiles in the population are still 
open questions. HIV-1 strains circulating locally (or appropri- 
ate combinations of local strains) should provide a more ef- 
fective prototype or candidate for a vaccine than a distinct 
virus circulating elsewhere. As such, a comprehensive molecular 
epidemiology study of locally circulating HIV-1 strains would 
facilitate the design of an effective AIDS vaccine for a partic- 
ular population, country, or. geographic region. 



Nucleotide sequence accession number. The 23 full-length 
HIV-1 sequences from Botswana are available under GenBank 
accession no. AF1 10959 to AF110981. 
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Most human immunodeficiency virus type 1 (HIV-1) transmissions in sub-Saharan Africa are believed to 
occur between married adults who are discordant for their HIV-1 infection status; however, no studies to date 
have investigated the molecular epidemiology of such transmission events. Here we report the genetic char- 
acterization of HIV-1 strains from 149 transmission pairs that were identified prospectively in a cohort of 
discordant couples in Lusaka, Zambia. Subgenomic gag, gpl20, gp41, and/or long terminal repeat regions were 
amplified by PCR analysis of uncultured blood samples from both partners and sequenced without interim 
cloning. Pairwise genetic distances were calculated for the regions analyzed and compared to those of subtype- 
specific reference sequences as well as local controls. Sequence relationships were also examined by phyloge- 
netic tree analysis. By these approaches, epidemiological linkage was established for the majority of trans- 
mission pairs. Viruses from 129 of the 149 couples (87%) were very closely related and clustered together in 
phylogenetic trees in a statistically highly significant manner. In contrast, viruses from 20 of the 149 couples 
(13%) were only distantly related in two independent genomic regions, thus ruling out transmission between 
the two partners. The great majority (95%) of transmitted viruses were of subtype C origin, although repre- 
sentatives of subtypes A, D, G, and J were also identified. There was no evidence for extensive transmission 
networks within the cohort, although two phylogenetic subclusters of viruses infecting two couples each were 
identified. Taken together, these data indicate that molecular epidemiological analyses of presumed transmis- 
sion pairs are both feasible and required to determine behavioral, virological, and immunological correlates of 
heterosexual transmission in sub-Saharan Africa with a high level of accuracy. 



By the end of the year 2000, an estimated 36 million adults 
and children were living with human immunodeficiency virus 
(HIV) infection-AIDS worldwide (39). More than 70% of 
these individuals resided in sub-Saharan Africa, where the 
average prevalence of HIV infection is currently 8.8% and 
transmissions occur predominantly through heterosexual 
routes or from mother to child (30). One of the African coun- 
tries with a particularly high prevalence of human immunode- 
ficiency virus type 1 (HTV-1) infection is Zambia, where it is 
estimated that 20% of all adults harbor HIV-1 and 20% of ail 
cohabitating couples are discordant for their HIV-1 infection 
status (i.e., one partner is HIV-1 positive and the other is 
negative) (40). Novel interventions designed to curtail the ex- 
plosive spread of HIV-1 in Zambia and other high-prevalence 
countries in sub-Saharan Africa are thus urgently needed but 
are likely to require detailed knowledge about the factors that 
influence heterosexual transmission. 

The Zambia-UAB HIV Research Project (ZUHRP) was 
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established in 1994 to provide voluntary HIV-1 testing and 
counseling, long-term monitoring, and health care to cohabi- 
tating couples in the capital city of Lusaka (3, 25). To date, 
9,569 couples have been tested for HIV-1, of whom 21% were 
HIV-1 discordant, 26% were concordant HIV-1 positive, and 
53% were concordant HIV-1 negative at the time of enroll- 
ment. Between February 1994 and October 2000, 1,022 discor- ' 
dant couples (535 with HIV-1 -infected men and 487 with HIV- 
1-infected women) were enrolled into a prospective study of 
the incidence and predictors of heterosexual transmission and 
were monitored at 3-month intervals for seroconversion of 
the seronegative partner. Although testing and counseling 
prompted substantial risk reduction in this cohort, a serocon- 
version rate of 8.5 per 100 person years remained, which was 
similar for male-to-female and female-to-male transmissions 
(12). Because frequent follow-up visits facilitated blood collec- 
tion from both the putative donor and the recipient after a 
transmission event, this cohort has provided a unique setting to 
examine the incidence, demographics, and behavioral and bi- 
ological correlates as well as the viral and host determinants of 
heterosexual transmission of HIV-1. However, a prerequisite 
for the acquisition of meaningful data, particularly with regard 
to predictors of contagion in the index seropositive partner, is 
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TABLE I. Subtype-specific full-length reference sequences from 
the HIV Sequence Database 
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FIG. 1. HIV-1 subgenomic regions utilized for epidemiological 
linkage analysis. A schematic representation of the HIV-1 genome is 
shown, with brackets denoting the subgenomic 'gag, gp!20, gp41, and 
LTR fragments that were amplified for diagnostic PCR analysis. Over- 
lapping gag and gpl20 regions are denoted by capital letters (gagA to 
-//; gp!20A to -C) and are referred to in Tables 2 to 4. 



the ability to confirm, with a high level of confidence, epide- 
miological linkage of HIV-1 transmission between members of 
all putative transmission pairs. 

Molecular analyses of suspected transmission links have 
been widely used to characterize localized HIV-1 outbreaks, 
mother-to-infant transmission, sexual transmission, sharing of 
contaminated needles, donation of contaminated blood, re- 
ceipt of contaminated clotting reagent, nosocomial transmis- 
sions from health care workers, and intrafamilial contacts (1, 4, 
5, 6, 13, 18, 21, 29, 36, 42, 44). In all of these cases, the 
establishment of ^ epidemiological linkage relied on the docu- 
mentation of closer genetic relatedness between viruses infect- 
ing the suspected transmission pair(s) compared to control 
viruses isolated from unrelated individuals in the same region. 
Here we developed a similar approach to confirm (or refute) 
heterosexual transmission among discordant couples within 
the ZUHRP cohort. 

Blood samples were collected between 1996 and 2000 from 
both partners of 149 (of a total of 162) discordant couples in 
whom seroconversion had been documented. The time period 
between the last negative and the first positive blood tests for 
the seroconvertor (which in most cases was also the blood 
sample used for linkage analysis) was 4.9 months on average, 
but in some cases it extended up to 4 years. High-molecular- 
weight DNA was extracted from whole blood or Ficoll gradi- 
ent-purified peripheral blood mononuclear cells by using the 
QIAamp Blood Kit (Qiagen, Valencia, Calif.). For a small 
number of couples, DNA was extracted from dried blood spots 
(9). Because of the known variability of HIV-1, different re- 
gions of the HIV-1 genome were targeted for PCR amplifica- 
tion, resulting in comparisons of gag, gp!20, gp41, and/or long 
terminal repeat (LTR) regions as shown in Fig. 1. Although 
the gp41 primers were by far the most cross-reactive, the suit- 
ability of this primer set was discovered only after alternative 
genomic regions from a number of transmission pairs had 
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already been analyzed (43). LTR, gpl20, and gp41 primers and 
amplification conditions have been described previously (15, 
16). The primers that were used to amplify sequences within 
gag were cgagA 5'-TGATAAAACCTCCAATTCCCCCTA 
T-3' and PBS1A 5 '-TTTGCCTGTACTGGGTCTCTCTGGT 
T-3' in the first round and cgagB 5 '-AATACTGTATCATCT 
G CTCCTGTATC-3 ' and PBS1B 5 ' -G CTTAAG CCTC AATA 
AAGCTTGCCTT-3 ' in the second round. PCR products were 
sequenced directly, using cycle sequencing and dye terminator 
methodologies, on an automated DNA sequencer (model 
377A; Applied Biosystems, Inc., Foster City, Calif.). Both 
strands of the PCR products were sequenced (sequences are 
available under GenBank accession numbers AF404868 
through AF405203, AF406742, and AF406743). Although pop- 
ulation-based sequencing was used to allow analysis of the 
predominant viral form in each individual, the number of am- 
biguous base pairs in the entire data set was <0.3%. 

To establish suitable linkage criteria for HIV-1 strains in- 
fecting the Zambian couples, amplified viral sequences were 
first subjected to preliminary phylogenetic tree analyses to 
identify all circulating HIV-1 group M subtypes (not shown). 
Full-length and nonrecombinant reference sequences repre- 
senting these subtypes were then obtained from the Los 
Alamos HIV Sequence Database (Table 1) and subjected to 
pairwise sequence comparisons in the genomic regions corre- 
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TABLE 2. Genetic diversity in different subgenomic regions for two sets of reference sequences 
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sponding to the PCR amplification products. Eight partially 
overlapping regions in gag, three in gpl20, one in gp41, and 
one in the LTR were used for analysis (Fig. 1). Uncorrected 
nucleotide sequence distances were then calculated for each 
transmission pair and compared to the mean sequence dis- 
tances calculated for the reference sequence set in the corre- 
sponding genomic region. The latter minus two standard de- 
viations (SDs) was arbitrarily assigned as the cutoff value for 
epidemiological^ linked sequence pairs (Table 2). Transmis- 
sion pairs were tentatively classified as epidemiological^ 
linked when their pairwise sequence distances fell below this 
limit. Conversely, transmission pairs were tentatively classified 
as unlinked when their pairwise distances exceeded this limit. 
For the subtype C reference set, only single representatives 
from India and Brazil were included, so as to not skew results 
due to the more recent introduction of HIV-1 into these coun- 
tries. Mean distances for the gp41 region of subtype J and the 
LTR region of subtype C could not be calculated because of a 
lack of sufficient reference sequences. 

Although the HIV-1 epidemic in Zambia is believed to be 
longstanding and mature (33), we examined the extent of ge- 
netic diversity of HIV-1 strains infecting all putative Zambian 
donors to exclude the possibility of a recent founder efFect 
within this cohort. As shown in Table 2, pairwise comparison of 
all Zambian donor sequences yielded mean distance values, 
SDs, and cutoff values that were very similar to those obtained 
for the reference sequences. This indicated that the selected 
reference sequences were indeed representative of the viruses 
infecting the cohort. There was no evidence for an unusually 
high degree of genetic relatedness among the Zambian donor 
viruses that could have confounded the linkage analysis. In- 
stead, the results suggested that the viruses circulating within 
the heterosexual transmission cohort were representative of 
the viruses circulating in the country at large. 

Having established suitable reference sequence sets, we next 
used the linkage criteria (Table 2) to tentatively classify the 149 
transmission pairs as either likely linked or unlinked. Table 3 



lists the identification number, dates of blood collection from 
donor and recipient (identical unless indicated otherwise), 
genomic region analyzed, and viral subtype for 129 transmis- 
sion pairs whose uncorrected pairwise distances fell below the 
cutoff value of the reference sequences (compare with Table 
2). Only one transmission pair (couple 136) yielded a pairwise 
distance (2.7%) that was slightly above the reference cutoff 
limit (2.6%). However, this pair was included as a likely linked 
transmission event after inspection of the two sequences re- 
vealed G-to-A hypermutation (41) as the cause of 9 of 10 
sequence changes between donor and recipient virus. G-to-A 
hypermutation was also identified as a reason for increased 
genetic diversity in four other pairs (couples 65, 132, 138, and 
149), although in these instances distance values did not exceed 
the cutoff limit. The majority of all couples listed in Table 3 
(127 of 129) also fell below the cutoff value of the Zambian 
donor sequences. These data thus indicated that most couples 
harbored viruses whose sequences were considerably more ho- 
mologous to one another than to unrelated reference se- 
quences from the database as well as local controls. 

Distance calculations also identified 20 couples harboring 
HIV-1 strains whose uncorrected pairwise distances exceeded 
the corresponding cutoff values, and this was confirmed by 
sequencing two independent genomic regions (Table 4). The 
great majority of pairwise distances from these transmission 
pairs fell well above the cutoff values of both sets of reference 
sequences (compare with Table 2), thus indicating a clearly 
discernible difference between linked and unlinked transmis- 
sion pairs (in the LTR region, Zambian donor sequences 
served as the sole reference set). This is best illustrated in Fig. 
2, where the pairwise distances of 15 subtype C reference 
sequences in the gp41 region are contrasted to the correspond- 
ing gp4i distances from 66 linked and 15 unlinked (subtype C) 
cohort transmission pairs. The median sequence distance of 
the viral group tentatively classified as linked was significantly 
different from the median distance of the viral group tenta- 
tively classified as unlinked as well as the median distance of 



TABLE 3. Genetic distances for linked Zambian transmission pairs 
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"The subgenomic region was analyzed as shown in Fig. 1. The subgenomic region size (in base pairs) is given in parentheses. F, 
b The cutoff value is 2 SD below the mean (see Table 2). 

r Pairwisc distances arc the percent sequence differences in the analyzed genomic region. 
d Determined as epidemiologically linked by phylogenetic tree analysis. 
' Genetic distance is primarily due to G-to-A hypermutation. 
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TABLE 4. Genetic distances for unlinked Zambian 
transmission pairs 



Sample 
ID 


Sample 
collection date 


Subgenomic 
region* 


Cutoff 
value* 


Pairwise 
distance 1 ' 
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9-23-98 


gpl20C 


8.3 


11.1 


c 


9 


8-16-00 


gp41 


5.4 


11.2 


c 




8-16-00 


g Q gP 


3.2 
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a Values are as defined in Table 3, footnote a. The subgenomic region was 
analyzed as shown in Fig. 1. F, female partner; M, male partner. 

h The cutoff value was 2 SD below the mean of the reference sequence set (see 
Table 2). . u , . 

c Pairwise distances are the percent sequence differences in the analyzed 
genomic region. 

rf The cutoff value for LTR sequences was derived from Zambian donor se- 
quences. 



the reference sequence group (P < 0.0001) by using a one- 
sided Mann-Whitney test (17). In contrast, the median se- 
quence distance of the unlinked viral group was not statistically 
different from that of the reference sequences (P > 0.05, 
Mann-Whitney test). 

In a final set of experiments, epidemiological linkage was 
assessed by phylogenetic tree analysis. PCR-derived viral se- 
quences from both partners were added to an existing master 
alignment (obtained from the Los Alamos HIV/SIV Sequence 
Database [http://hiv-web.lanl.gov/HTlvtUalignments.html]) that 
contained all reference sequences listed in Table 1. Sequences 
were aligned by using CLUSTAL W (19) and adjusted manu- 
ally by using MASE (10). Sites with a gap in any of the se- 
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FIG. 2. Within-group diversity among linked and unlinked Zam- 
bian transmission pairs and corresponding reference sequences. Sub- 
type C reference sequences (n = 15) from the Los Alamos HIV/SIV 
Sequence Database (Table 1) were subjected to pairwise sequence 
comparisons in the region corresponding to the PCR-amplified gp41 
fragment shown in Fig. 1. Pairwise sequence distances were also cal- 
culated for 66 subtype C transmission pairs classified as linked and 15 
subtype C transmission pairs classified as unlinked in the same 
genomic region. The distribution of distance values for these three 
different groups is depicted as boxes, with the lower and upper limits 
of the box delineating the 25th and 75th percentiles and the bars 
indicating the 10th and 90th percentiles, respectively. The median 
distance of the linked viral group (median = 1.5) was significantly 
different from that of both the unlinked viral group (median dis- 
tance = 8.8) and the reference sequence group (median distance = 
8.2) (P < 0.0001, one-sided Mann-Whitney test [17]). In contrast, the 
median sequence distance of the unlinked viral group was not statis- 
tically different from that of the reference sequence group (P > 0.05, 
Mann-Whitney test). 



quences or sites that were ambiguous due to the population 
sequence approach were excluded from further analyses. Evo- 
lutionary distances were corrected for superimposed hits by 
using Kimura's two-parameter method (22). Phylogenetic trees 
were constructed by using the neighbor-joining method (34), 
and the reliability of topologies was estimated by using the 
bootstrap approach (11). Bootstrap values of >80% were con- 
sidered significant (4, 20, 29, 36, 44). An example of a phylo- 
genetic tree constructed from gp41 sequences of 42 transmis- 
sion pairs and 26 reference sequences is shown in Fig. 3. All 
transmission pairs initially classified as linked by pairwise dis- 
tance analysis (depicted in red) also clustered together in phy- 
logenetic trees with significant bootstrap values (indicated by 
asterisks). Similarly, all transmission pairs initially classified as 
unlinked (depicted in blue) were not significantly related in 
phylogenetic trees. The latter was true for the two independent 
genomic regions analyzed (not shown). Finally, viral sequences 
derived from 98M and 98F (Fig. 3), which clustered with sub- 
type J viruses, were significantly related to each other and thus 
classified as epidemiologically linked. 

Phylogenetic tree analysis also yielded a subtype designation 
for each of the viruses infecting the 149 transmission pairs 
(data not shown). As shown in Fig. 3, the overwhelming ma- 
jority (141 of 149; 95%) of enrolled couples were infected with 
subtype C viruses. Three couples harbored subtype C viruses, 
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three couples harbored subtype A viruses, one couple har- 
bored subtype D viruses, and one couple harbored subtype J 
viruses, all representing linked transmissions. To determine 
whether non-subtype C viruses were introduced more recently, 
patient records were examined for the first occurrence of non- 
subtype C viruses (not shown). The results revealed no partic- 
ular association between the date of enrollment and the ap- 
pearance of non-subtype C strains within the ZUHRP cohort: 
couples infected with subtype A viruses were enrolled in 1996 
and 1999; couples infected with subtype G were enrolled in 
1995, 1996, and 1998; couples infected with subtype D were 
enrolled in 1998; and couples infected with subtype J were 
enrolled in 1997. If we assume no recombination in the re- 
mainder of the genome, these results indicate that subtype C 
predominates within the ZUHRP cohort. 

Finally, phylogenetic analysis allowed us to examine the evo- 
lutionary history of the cohort viruses compared to other 
HIV-1 strains from the same subtype. In particular, we were 
interested in determining whether ZUHRP couples were par- 
ticipating in transmission networks involving closely related 
viruses. Inspection of the phylogenetic tree in Fig. 3 revealed 
only two significant subclusters (indicated by brackets), each 
involving viruses from two sets of couples, which are shown in 
greater detail in Fig. 4. One subcluster involved subtype C 
viruses infecting couples 19 and 61, while the other involved 
subtype G viruses infecting couples 91 and 148. Given the short 
genomic region analyzed and the nonsignificant or borderline 
significant bootstrap values for three of the four couples (cou- 
ples 19, 61, and 91), we could not determine with confidence 
that transmission had occurred between the partners of the 
same rather than different couples. The exact sequence of 
transmission events involving couples 19 and 61 and couples 91 
and 148, respectively, thus remains to be determined. Rapid 
viral passage from a donor through one or more unidentified 
intermediaries to his or her putative recipient remains a the- 
oretical possibility for all transmission pairs classified as epi- 
demiologically linked in this study. However, since no other 
viral subclusters were identified in the data set, the existence of 
extensive transmission networks within the ZUHRP cohort is 
highly unlikely. 

In summary, this report describes the first comprehensive 
molecular epidemiological analysis of heterosexual transmis- 
sion events occurring among discordant couples in an African 
urban setting. Our analysis allowed us to (i) determine the 
proportion of linked and unlinked infections with a high level 
of certainty, (ii) identify the sequence subtype for all transmit- 
ted viruses in the genomic regions analyzed, and (iii) examine 
the cohort for evidence of transmission networks. The results 
show that of 149 cohabitating couples assumed to have trans- 



Subtype C 



Subtype G 




-98IS002.5 
-96BW01B21 



HH8793 
• 91M 



J »i 

k 



• 148F 



148M 



-DRCBL 



- 92NG083.2 



-SE6165 



0.05 



FIG. 4. Transmission networks within the ZUHRP cohort. Phylo- 
genetic trees were constructed from gp41 sequences of viruses infect- 
ing four different transmission pairs putatively classified as linked by 
pairwise sequence analysis. Couple identifiers are indicated in red (F, 
female partner; M, male partner). Horizontal branch lengths are 
drawn to scale (the scale bar represents 0.05 nucleotide substitutions 
per site); vertical separation is for clarity only. Values at nodes indicate 
the percentage of bootstraps in which the cluster to the right was 
found; only values of s80% are shown. Representative subtype C 
(left) and subtype G (right) reference sequences are included in each 
tree. Donor partners are underlined. 



mitted to each other, 129 (87%) were molecularly confirmed as 
epidemiological^ linked. Nevertheless, approximately 1 in ev- 
ery 10 transmission events involved an individual outside of the 
partnership. Assumptions concerning transmission linkage 
based on patient self-reporting alone are thus unlikely to be 
accurate, and this needs to be factored into the interpretation 
of transmission data from cohorts in which linkage has not 
been independently verified. For example, we found a stronger 
association between plasma viral load and transmission for 
female- to-male than for male-to-female transmissions in the 
ZUHRP cohort (12), while such a gender-based difference was 
not observed in a discordant couple cohort studied in Rakai, 
Uganda (32). Because transmission linkage was not confirmed 
at the molecular level, it is possible that some of the putative 
transmitters in this Ugandan cohort were misclassified. The 
proportion of unlinked transmissions is likely to vary consid- 
erably depending on the demographic, ethnic, and behavioral 
circumstances characterizing a cohort (45) but will undoubt- 
edly be >0%. Thus, for investigations that require accurate 



FIG. 3. Molecular linkage analysis for a subset of putative HIV-t transmission pairs. A phylogenetic tree was constructed from partial gp41 
sequences (consensus length, 276 bp) by using the neighbor-joining method (34) and the Kimura two-parameter model (22). Horizontal branch 
lengths are drawn to scale (the scale bar represents 0.05 nucleotide substitutions per site); vertical separation is for clarity only. Asterisks indicate 
bootstrap values in which the cluster to the right is supported in >80% replicates (out of 1,000). Newly derived sequences from 42 transmission 
pairs (84 individuals) are shown, along with 26 reference sequences from the Los Alamos Sequence Database (http://hiv-web.lanl.gov/HTML/ 
alignments.html). Viruses from 36 couples are closely related to one another and cluster together with significant bootstrap values, indicating that 
they are epidemiologically linked (highlighted in red and denoted by dots). Viruses from six couples do not cluster together and exhibit a range 
of within-couple diversity that is similar to that of the reference sequences (highlighted in blue and denoted by triangles), indicating that they are 
epidemiologically unlinked. Two small brackets denote viral subclusters, each involving viruses from two sets of couples (see text for details). 
Brackets on the far right indicate major group M sequence subtypes. 



404 NOTES 



J. Virol. 



assessment of HIV-i transmission, such as studies aimed at 
identifying host and viral transmission correlates or determin- 
ing the effectiveness of certain prevention strategies, the mo- 
lecular characterization of viruses from both partners is essen- 
tial. 
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The Ability of an Oligomeric Human Immunodeficiency Virus 
Type 1 (HIV-1) Envelope Antigen To Elicit Neutralizing 
Antibodies against Primary HIV-1 Isolates Is Improved 

following Partial Deletion of the Second Hypervariable Region 
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Partial deletion of the second hypervariable region from the envelope of the primary-like SF162 virus 
increases the exposure of certain neutralization epitopes and renders the virus, SF162AV2, highly susceptible 
to neutralization by clade B and non-clade B human immunodeficiency virus (HIV-positive) sera (L. Stama- 
tatos and C. Cheng-Mayer, J. Virol. 78:7840-7845, 1998). This observation led us to propose that the modified, 
SF162AV2-derived envelope may elicit higher titers of cross-reactive neutralizing antibodies than the unmod- 
ified SF162-derived envelope. To test this hypothesis, we immunized rabbits and rhesus macaques with the 
gpl40 form of these two envelopes. In rabbits, both immunogens elicited similar titers of binding antibodies but 
the modified immunogen was more effective in eliciting neutralizing antibodies, not only against the SF162AV2 
and SF162 viruses but also against several heterologous primary HIV type 1 (HIV-1) isolates. In rhesus 
macaques both immunogens elicited potent binding antibodies, but again the modified immunogen was more 
effective in eliciting the generation of neutralizing antibodies against the SF162AV2 and SF162 viruses. 
Antibodies capable of neutralizing several, but not all, heterologous primary HIV-1 isolates tested were elicited 
only in macaques immunized with the modified immunogen. The efficiency of neutralization of these heterol- 
ogous isolates was lower than that recorded against the SF162 isolate. Our results strongly suggest that 
although soluble oligomeric envelope subunit vaccines may elicit neutralizing antibody responses against 
heterologous primary HIV-1 isolates, these responses will not be broad and potent unless specific modifications 
are introduced to increase the exposure of conserved neutralization epitopes. 



Analysis of the crystal structure of the gpl20 human immu- 
nodeficiency virus (HIV) envelope subunit indicated that neu- 
tralization epitopes are primarily clustered in one face of this 
protein, which is naturally occluded within the oligomeric en- 
velope form, i.e., that present on the surface of virions and 
infected cells (16, 37). These structural observations are sup- 
ported by numerous immunochemical and virological studies 
(1, 24, 25, 27, 28, 31, 35, 38, 40). 

Several reports have indicated that specific modifications 
(such as deglycosylations and loop deletions) introduced in the 
envelope glycoproteins of HIV and simian immunodeficiency 
virus (SIV) may increase the exposure of neutralization 
epitopes. Wyatt et al. demonstrated that on the background of 
the HXB2 virus, a laboratory-adapted CXCR4-using (X4-us- 
ing) virus, deletions of the first, second, and third hypervariable 
regions (VI, V2, and V3 loops, respectively) of the gpl20 
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envelope subunit increase the exposure of epitopes participat- 
ing in HIV envelope-CD4 and -coreceptor binding (38, 40). 
Subsequently, it was demonstrated that the simultaneous de- 
letion of the VI and V2 loops from the envelope of this virus 
increases it susceptibility to neutralization by anti-V3 loop and 
certain CD4-induced monoclonal antibodies (MAbs) (3). Reit- 
ter et al. reported that elimination of specific asparagine-linked 
glycosylation sites located in the VI loop of SIVmac239 results 
in the exposure of neutralization epitopes and, importantly, 
increases their immunogenicity (25). Infection of macaques 
with SI Vmac239-derived viruses expressing such partially de- 
glycosylated envelopes results in the generation of antien- 
velope antibodies capable of neutralizing the parental virus 
SIVmac239, which displays a fully glycosylated envelope, more 
efficiently than antibodies elicited during infection of ma- 
caques with SIVmac239 itself. 

We previously reported that on the background of the SF162 
virus, a primary-like CCR5-using (R5-using) isolate, deletion 
of the 30 amino acids from the central region of the V2 loop 
(SF162AV2) does not abrogate its infectivity but renders it 
highly susceptible to neutralization by sera collected from pa- 
tients infected with heterologous HIV type 1 (HIV-1) isolates 
(30). We hypothesized that on the background of the SF162 
envelope, partial elimination of the V2 loop increases the 
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exposure of neutralization epitopes that are conserved among 
heterologous primary HIV-1 isolates. 

In this study, we compared the immunogenic potentials of 
the unmodified SF162 and modified SF162AV2 (hereafter des- 
ignated AV2) envelopes. Using the gene gun vaccination 
method, we immunized rabbits with the gpl40 form of the 
SF162 and AV2 envelopes. We observed that both immuno- 
gens elicited the generation of similar antibody titers, but that 
the modified immunogen elicited higher titers of neutralizing 
antibodies against the parental SF162 virus than the unmodi- 
fied immunogen. These results are in agreement with those 
previously reported in the case of SIVmac239 (25), because 
they suggest that specifically modified envelope immunogens 
are more effective than the corresponding unmodified enve- 
lope immunogens in eliciting neutralizing antibodies against 
the homologous parental virus: Additionally, the AV2-derived 
modified immunogen was more effective than the SF162-de- 
rived unmodified immunogen in generating antibodies capable 
of neutralizing heterologous primary HIV-1 isolates. 

The immunogenicity of these two antigens was also evalu- 
ated in rhesus macaques, an animal model more closely related 
to humans and more suitable for HIV vaccine studies, using 
the DNA-prime-protein-boost vaccination method. Here too 
we recorded that the modified immunogen was more effective 
than the unmodified immunogen in generating potent neutral- 
izing antibodies both against the homologous SF162AV2 and 
parental SF162 viruses. The antibodies elicited in macaques by 
the modified, but not unmodified, immunogen neutralized sev- 
eral, but not all, heterologous primary HIV-1 isolates. The 
neutralizing potential against the heterologous isolates tested 
was lower than that against the parental SF162 virus. Previous 
studies reported that cross-reactive neutralizing antibodies 
against primary HIV-1 isolates could be elicited in mice im- 
munized with fusion-competent vaccines (18) or soluble oligo- 
meric envelopes derived from a primary-like HIV envelope 
(41). Our studies indicate for the first time that potent cross- 
reactive neutralizing antibodies can be elicited in nonhuman 
primates immunized with soluble oligomeric subunit HIV en- 
velope vaccines derived from an R5-using primary-like HIV-1 
isolate. They strongly suggest, however, that specific envelope 
modifications can be introduced to increase the exposure of 
neutralization epitopes and increase the breadth and potency 
of these responses. 

MATERIALS AND METHODS 

Viruses. The isolation and phenotypic characterization of the SFI62 and 
SF162AV2 isolates were previously reported (5, 30). The primary clade B HIV-1 
isolates 92US660, 92HT593, 92US657, 92US714, 92US727, 91US056, 91US054 
and 93US073 were obtained from the NIH AIDS Research and Reference 
Reagent Program. All viral stocks were prepared and titrated in activated human 
peripheral blood mononuclear cells (PBMC). 

Vaccines. The DNA vector used to express our immunogens in rabbits is the 
pJW4303 (20). The DNA vector used to immunize rhesus macaques is derived 
from pCMVKm2 (4, 43). Both DNA piasmids contain the human cytomegalo- 
virus enhancer/promoter elements, and the native leader peptide of the HIV 
envelope was replaced with that derived from the tissue-specific plasminogen 
activator gene. In the case of macaque immunizations, the DNA construct was 
codon optimized for high expression in mammalian cells. Both DNA vectors 
express the gpI40 ectodomain form of the HIV envelope immunogen, with an 
intact gpl20-gp41 cleavage site. 

Protein-boosting immunizations were performed only in rhesus macaques to 
increase the titer of antibodies elicited following the DNA phase of immuniza- 
tion. For this purpose, the AV2 gp!40 protein was produced in CHO cells and 



purified as stable soluble trimers (I. Srivastava et al., unpublished data). To 
increase the stability of these secreted oligomers, the gpl20-gp41 cleavage site 
was eliminated by mutagenesis (9, 10, 32). 

Immunizations (i) Rabbits. Each animals received five DNA immunizations 
(each immunization consisting of 36 shots of 0.5 jig of DNA each) by the gene 
gun vaccination method (20) at weeks 0, 4, 8, 18, and 22. Blood was drawn 2 
weeks following each immunization. Six animals (Al to A6) were immunized 
with the unmodified SF162 gpl40 immunogen, and six animals (A7 to A12) 
received the modified AV2 gpl40 immunogen. Two animals (A13 and A14) 
served as controls and were immunized with the DNA vector alone. 

(ii) Rhesus macaques. Animals H445 and J408 were immunized with the 
modified AV2 gpl40 immunogen, animals N472 and P655 were immunized with 
the unmodified SF162 gp!40 immunogen, and animals M844 and H473 were 
immunized with the DNA vector alone. Before immunization, the animals were 
tested for antibodies to various simian viruses such as SIV, type D retroviruses, 
and simian T-lymphocytic virus type 1. Animals vaccinated with the modified 
envelope were immunized with DNA at weeks 0, 4, and 8, and animals vaccinated 
with the unmodified envelope were immunized with DNA at weeks 0, 4, and 9. 
The DNA (2 mg of DNA in 1 ml of endotoxin-free water each time per animal) 
was administered both intradermally at two sites (0.2 mg at each site) and 
intramuscularly (0.8 mg at two sites in the quadriceps muscles). Animals were 
immunized a fourth time with DNA and at the same time with the purified 
oligomeric AV2 or SF162 gpl40 protein mixed with the adjuvant MF-59C. The 
proteins (0.1 mg of purified protein in 0.5 ml [total volume] per animal) were 
administered intramuscularly in the deltoids. The control animals received only 
adjuvant. This DNA-plus-protein booster immunization took place at week 27 
for animals vaccinated with the modified immunogen and at week 48 for animals 
immunized with the unmodified immunogen. At week 38, the animals immu- 
nized with the modified, but not those immunized with the unmodified, immuno- 
gen were immunized one additional time with the adjuvanted protein alone (no 
DNA). 

Antibody determination (i) Anti-gpl40 antibodies. Titers were determined 
throughout the immunization protocol by enzyme-linked immunosorbent assay 
(ELISA) as previously described (31, 33). Briefly, purified soluble oligomeric 
AV2 and SF162 gpl40 proteins were used to coat ELISA plates (Immulon 2HB) 
(0.2 u-g of protein in 0.1 ml of 100 mM NaHC03 [pH 8.5]) by overnight 
incubation at 4°C. Nonadsorbed protein molecules were removed by washing 
with Tris-buffered saline (TBS), and the wells were blocked with SuperBlock 
(SB; Pierce). Heat-inactivated (56°C for 35 min) sera collected from the immu- 
nized animals were serially diluted in SB and added to the wells (0.1 ml per well) 
for 1 h at 37°C. In the case of rabbits, sera from control animals receiving the 
DNA vector alone were used as negative controls. In the case of macaques, 
preimmunization sera were used as negative controls. Unbound antibodies were 
removed by TBS washing, and the envelope-bound antibodies were detected with 
the use of goat anti-human (in the case of rhesus sera) or anti-rabbit (in the case 
of rabbit sera) immunoglobulin G coupled to alkaline phosphatase antibodies 
(Zymed Immunochemicals) as previously described (31). The optical density at 
490 nm (OD 490 ) of each well was recorded with a Biolummometer (Molecular 
Dynamics). A plot of the OD 490 signals versus serum dilution was generated, and 
endpoint antibody titers were determined as the highest postimmunization se- 
rum dilution that produces an OD 4W value three times that of the OD 490 
produced by the preimmunization sera at their lowest dilution. Sera from various 
stages of immunization were tested at the same time. 

• (ii) Anti-V3 loop antibodies. Titers of anti-V3 loop antibodies generated dur- 
ing immunization were determined by ELISA using the peptide CKSITIGPG 
RAFYATGDC, derived from the central region of the SF162-derived V3 loop. 
This peptide was diluted in 0.2 M sodium bicarbonate (pH 9.4) at a concentra- 
tion of 1 u,g/ml and then used to coat ELISA plates (0.1 ml per well) by overnight 
incubation at 37°C. The wells were washed with TBS and blocked with SB as 
described above. Sera collected before and after immunization were serially 
diluted in SB containing 03% (vol/vol), Tween and added (0.1 ml per well) for 
2 h at room temperature. The plates were washed with TBS containing 0.3% 
(vol/vol) Tween, and V3 loop-bound antibodies were detected as described 
above with the use of immunoglobulin G coupled to alkaline phosphatase anti- 
bodies of the appropriate species. The antibody titers were determined as de- 
scribed above. 

Neutralization assays. Neutralization assays were performed using as target 
cells human PBMC activated for 3 days with phytohemagglutinin (3 p-g/ml; 
Sigma) as previously described (21, 22, 30, 34). All HIV-1 isolates tested were 
grown and titrated in human PBMC, aliquoted, and kept frozen at -80°C until 
further use. Viruses (50 to 100 50% tissue culture infective doses in 50 p.1 of 
complete RPMI medium containing 20 U of interleukin-2 [Hoffmann-La Roche] 
per ml) were preincubated with an equal volume of serially diluted heal-inacii- 
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vated (35 min at 56°C) sera for 1 h at 37°C in 96-well U-bottom plates (Coming). 
For each serum dilution, triplicate wells were used. Preimmunization sera from 
macaques and sera collected from rabbits immunized with the DNA vector alone 
were also incubated with the viruses and sewed as controls for nonspecific 
neutralization. To each well, 0.1 ml of complete medium containing 0.4 X 10 6 
phytohemagglutinin-activated PBMC was added. Following overnight incubation 
at 37°C, half of the volume of each well was replaced with fresh, complete RPMI 
medium. Following centrifugation of the plates (5 min at 2,000 rpm), half of the 
volume of each well was again replaced with fresh medium. This procedure was 
repeated twice. The p24 antigen concentration in each well was evaluated at 
various points following infection (usually at days 4, 6, and 11), using an in-house 
EL1SA p24- detection assay. The mean percent neutralization from triplicate 
wells and the standard deviation for each serum dilution were calculated based 
on p24 concentrations recorded in wells containing virus, cells, and no rabbit or 
macaque serum as previously described (34). However, we noticed that infection 
of some isolates was reduced in the presence of preimmunization sera (nonspe- 
cific neutralization). We decided therefore to present the results from our neu- 
tralization studies in two ways: (i) in the same figure, we present both the 
neutralization curve recorded with sera collected prior to vaccination (prebleeds) 
and that recorded with sera collected at various stages following vaccination; (ii) 
for each serum dilution, we calculate the difference between the percent neu- 
tralization recorded with postvaccination sera minus that recorded with prevac- 
cination sera. Jn some figures, this difference (which we term specific neutral- 
ization) is plotted as a function of serum dilution. In parallel, we evaluated the 
susceptibilities of the various primary isolates to neutralization by MAbs 2F5 and 
2G12. 

During these neutralization experiments, we also evaluated the abilities of sera 
collected from macaques immunized with the recombinant SF2 gpl20 envelope. 
This immunogen was previously tested as a potential vaccine against HIV and 
failed to raise cross-reactive neutralizing antibodies (22). 

RESULTS 

Generation of antibodies in rabbits. Both the SF162- and 
SF162AV2-derived immunogens elicited high titers of antibod- 
ies capable of binding to both oligomeric AV2 and SF162 
gpl40 (Fig. 1). As expected, variantions in the antibody titers 
were recorded throughout the vaccination schedule in animals 
belonging to either group. However, no statistically significant 
differences in antibody titers were recorded between the two 
animal groups throughout the immunization schedule. The 
antibody titers in each animal, regardless of whether it was 
immunized with the modified or the unmodified immunogen, 
were very weak during the first two immunizations (at 0 and 4 
weeks). The fourth immunization (at 18 weeks) resulted in an 
increase in antibody titers, compared to the third immuniza- 
tion (8 weeks), between 2 and 3 Iog lo in both animal groups. 
The fifth immunization (22 weeks) increased the antibody ti- 
ters, compared to the fourth immunization, against the SF162 
gpl40 antigen (by less than 1 log 10 ) but not against the AV2 
gpl40 protein. At the end of the vaccination schedule, very 
potent endpoint ELISA binding antibody titers in the order of 
10 5 to 10 6 were recorded in both animal groups against both 
antigens. Thus, it appears that in rabbits, based on the assay 
used here to determine antibody titers, the modified immuno- 
gen is as effective as the unmodified immunogen in eliciting the 
generation of antibodies even though the former immunogen 
lacks 30 amino acids from the V2 loop. 

Neutralizing activity in rabbit sera against the SF162 and 
SF162AV2 isolates. Both immunogens generated neutralizing 
antibodies against the SF162AV2 virus following the third 
DNA immunization (Fig. 2A). A trend toward higher neutral- 
ization titers in the modified immunogen-vaccinated group was 
recorded. Thus, the mean (± standard error) serum dilutions 
at which 70% inhibition of infection was recorded for SF162 



gpl40- and AV2 gpl40-immunized animals were 179 (±34) 
and 483 (±148), respectively. At this stage of vaccination, 
while two (A8 and A9) out of sue animals immunized with the 
modified immunogen elicited neutralizing antibodies against 
the parental SF162 isolate, none of the animals immunized 
with the unmodified immunogen elicited antibodies capable of 
doing so (Fig. 2B). However, the number of animals that gen- 
erated neutralizing antibodies against the SF162 and 
SF162AV2 viruses increased with each subsequent immuniza- 
tion, so that at the end of the immunization schedule (i.e., after 
the fifth immunization) all animals had generated neutralizing 
antibodies against the SF162 virus. In addition, the neutraliz- 
ing potency of each serum, regardless of whether the animal 
was vaccinated with the modified or unmodified immunogen, 
increased with each immunization. 

At the end of the immunization schedule, sera collected 
from rabbits immunized with the modified immunogen had 
higher neutralization potency against both SF162AV2 and 
SF162 viruses than sera collected from animals immunized 
with the unmodified immunogen. Six out of six animals immu- 
nized with the modified immunogen elicited antibodies capa- 
ble of neutralizing the SF162AV2 virus between 70 and 100% 
at a 1:5,000 dilution (Fig. 2A). In contrast, at the same serum 
dilution only one (Al) of the six animals vaccinated with the 
unmodified envelope developed antibody responses able to 
neutralize SF162AV2 infection, and that by only 50%. The 
remaining five animals in this group failed to elicit antibody 
responses potent enough to neutralize SF162AV2 infection to 
any significant extent at this dilution. Differences in neutraliz- 
ing potential between sera collected from animals immunized 
with the modified immunogen and those immunized with the 
unmodified immunogen were also evident when their abilities 
to neutralize the SF162 virus were compared (Fig. 2B). Sera 
collected from four (A8, A9, A10, and A12) out of six animals 
immunized with the modified antigen neutralized SF162 infec- 
tion between 70 and 90% at 1:100 to 1:300 dilutions. In con- 
trast, none of the sera collected from animals immunized with 
the unmodified antigen could inhibit SF162 infection by 70 to 
90% at the same dilutions. 

Generation of cross-reactive neutralizing antibodies in rab- 
bits. The fact that the SF162AV2-derived envelope immuno- 
gen was capable of eliciting higher titers of neutralizing anti- 
bodies against the parental SF162 isolate (which expresses the 
full envelope) than the immunogen derived from the SF162 
isolate itself prompted us to examine whether the modified 
immunogen was also more effective in eliciting cross-reactive 
neutralizing antibodies, i.e., antibodies capable of neutralizing 
heterologous to the vaccine primary HIV-1 isolates. We tested 
several such isolates whose neutralization susceptibility to var- 
ious MAbs was previously documented (8). Only two 
(92US714 and the 92HT593) out of the six isolates examined 
were neutralized by antibodies elicited by the unmodified im- 
munogen (Table 1). All animals except animal Al developed 
neutralizing antibodies against 92US714, while only animals 
A2 and A5 generated neutralizing antibodies against 92HT593. 
In contrast, four out of the six animals immunized with the 
modified AV2 gpl20 immunogen generated cross-reactive neu- 
tralizing antibodies against most of the heterologous isolates 
tested. In addition, the neutralization potency of sera collected 
from animals immunized with the modified immunogen was 
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FIG. 1. Development of antibodies in rabbits. Six animals (Al to A6) were immunized with DNA expressing the unmodified SF162 gpl40 
immunogen, and six (A7 to A12) were immunized with DNA expressing the modified AV2 gpl40 immunogen. Titers were determined 2 weeks 
following each immunization, as described in Materials and Methods, against the oligomeric SF162 and AV2 gpl40 proteins. Dashed lines indicate 
times of immunizations. 



higher than that of sera collected from animals immunized 
with the unmodified immunogen (Table 1). Thus, although 
80% inhibition of infection was frequently recorded with the 
former sera, this level of inhibition was recorded in only two 
instances (sera from animal A5 versus the 92US714 and 
92HT593 isolates). 

Development of antibodies in rhesus macaques vaccinated 
with the modified AV2 gpl40 immunogen. The above results 
prompted us to evaluate the immunogenic potential of the 
unmodified SF162 gpl40 and modified AV2 gpl40 antigens in 
rhesus macaques, an animal model where the protective po- 



tential of vaccine-elicited antibodies can eventually be evalu- 
ated. Macaques were vaccinated with these two immunogens 
by the DNA-prime-protein-boost vaccination method. 

Envelope-specific antibodies became detectable following 
the second DNA immunization (Fig. 3). At this stage, endpoint 
ELISA titers in animals immunized with the modified antigen 
(animals J408 and H445) were in the order of 1:2,000. In 
contrast, in animals immunized with the unmodified envelope 
(animals N472 and P655), antibodies were detectable only in 
animal N472 (endpoint ELISA titers in the order of 1:500). 
With the exception of animal H445, the third DNA immuni- 
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FIG 2 Results of neutralization experiments using rabbit sera collected following the third and fifth immunizations against the SF162A V2 (A) 
and SF162 (B) viruses Data are representative of at least three independent experiments. Symbols indicate the mean percent neutralization and 
standard deviation from triplicate wells. Dashed lines indicate 50, 70, and 90% inhibition of infection. Asterisks (controls) represent neutralization 
curves obtained with sera collected from animals that were immunized with the DNA vector alone and are indicative of nonspecific neutralization. 



zation did not further increase the antibody titers. Anti-gpl20 
and anti-gp41 antibodies were generated synchronously during 
DNA immunization (data not shown). 

During the subsequent 5 to 10 months of observation, anti- 
bodies were undetectable in animals immunized with the un- 
modified SF162 gpl40 immunogen, while in animals immu- 
nized with the modified AV2 gpl40 immunogen the antibodies 
were always detectable, but their titers declined over time. 

Following the DNA-plus-protein booster immunization, the 
antibody titers increased significantly in all animals. At their 
peak values (reached within 2 to 4 weeks postboosting), end- 
point ELISA antibody titers in animals immunized with the 
modified AV2 gpl40 immunogen were 1:30,000 for animal 
J408 and 1:110,000 for animal H445. The titers decreased 
gradually over time and remained stable at approximately 



1:8,000 for several weeks in both animals. Higher peak anti- 
body titers were recorded in animals vaccinated with the un- 
modified SF162 gpl40 immunogen (endpoint ELISA antibody 
titers of 1:150,000 in animal N472 and 175,000 in animal P655). 
During the following 7 weeks of observation, the antibody 
titers decreased more rapidly in both animals to approximately 
1:35,000. Thus, in contrast to what we recorded in rabbits, in 
macaques the unmodified immunogen generated higher titers 
of binding antibodies than the modified immunogen. 

As expected, anti-HIV envelope antibodies were not gener- 
ated in control animals (M844 and H473) immunized with the 
DNA vector alone. 

Neutralizing activity of macaque sera against the homolo- 
gous SF162AV2 and parental SF162 isolates. During the DNA 
phase of immunization, only animals immunized with the mod- 
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ified AV2 gpl40 immunogen elicited neutralizing antibodies 
against the SF162 and SF162AV2 viruses (Fig. 4). Following 
the second DNA immunization, animal J408 developed neu- 
tralizing antibodies against the homologous SF162AV2, but 
not the parental SF162, isolate (Fig. 4A). The titer of neutral- 
izing antibodies in animal J408 increased following the third 
DNA immunization, at which point neutralization of both iso- 
lates was recorded, although the titers of binding antibodies 
did not increase in parallel (Fig. 3). In contrast, much weaker 
neutralizing antibody responses against the SF162AV2 and no 
neutralizing responses against the SF162 virus were elicited in 
animal H445, even though this animal generated titers of bind- 
ing antibodies similar to those generated in animal J408 (Fig. 3). 

Two weeks following the DNA-plus-protein booster immu- 
nization, sera collected from animals immunized with either 
immunogen inhibited SF162AV2 infection. The neutralization 
potency of sera collected from animals immunized with the 
modified immunogen was higher than that of sera collected 



from animals immunized with the unmodified immunogen. For 
example, 50% inhibition of SF162AV2 infection was recorded 
at dilutions of 1:2,000 to 1:5,000 from the former sera, but this 
level of inhibition was not recorded at this dilutions with the 
latter sera. Both AV2 gpl40-immunized animals generated 
strong neutralizing antibodies against the parental SF162- virus, 
while only one (N472) of the two animals immunized with the 
SF162 gpl40 immunogen generated neutralizing antibodies 
against this virus. Changes in the neutralizing potency of these 
sera were not recorded during the subsequent 2 weeks, even 
tough changes in the antibody titer levels were detectable dur- 
ing this period (Fig. 3). Control animals (M844 and H473) 
vaccinated with the vector alone did not develop neutralizing 
antibodies (data not shown). • 

Neutralization of heterologous primary HIV-1 isolates by 
macaques sera. The breath of the neutralizing antibody re- 
sponses elicited in macaques immunized with the modified and 
unmodified immunogens was evaluated by comparing the abil- 
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TABLE 1. Generation of cross- reactive neutralizing antibodies in rabbits 
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a Neutralizing activity was evaluated at 1:10 dilution, taking into consideration the nonspecific neutralization recorded with sera collected from animals vaccinated 
with the DNA vector alone (see Materials and Methods for details). -, 50% specific neutralization was not recorded; +, 50 or 80% specific neutralization was recorded. 
Results are from three independent neutralization experiments. 



ities of sera collected from macaques immunized with these 
two immunogens to block infection of heterologous primary 
clade B HIV-1 isolates. During our serum neutralization ex- 
periments, we evaluated in parallel the susceptibility of these 
isolates to neutralization by two of the most commonly used 
primary-isolate-neutralizing MAbs (2F5 and 2G12) (Table 2). 

Heterologous isolate neutralization was not recorded (less 
than 50% inhibition of infection at 1:10 serum dilution) during 
the DNA phase of immunization in macaques (data not 
shown). Two weeks following the DNA-plus-protein booster 
immunization, sera collected from the two animals vaccinated 
with the modified AV2 gpl40 protein neutralized some of the 
heterologous primary HIV-1 isolates tested (Fig. 5). At the 
lowest serum dilution tested (1:10), and when nonspecific neu- 
tralization recorded with preimmunization sera was taken into 
consideration (see Materials and Methods for details), 80 to 
90% inhibition of infection was recorded only with the ADA, 
91US056, and 92US714 isolates by J408 sera and with the 
ADA, 92US714, and 92US660 isolates with the H445 sera (Fig. 
5 and Table 2). The cross-neutralizing activity of the sera 
collected from these two animals differed. For example, 
92US660 infection was inhibited by 80 and 50% by H445 and 
J408 sera, respectively. The serum cross-neutralizing activity 
decreased during the subsequent weeks of observation (Fig. 5). 
Sera collected 5 weeks following this DNA-plus-protein 
booster immunization had no cross-reactive neutralizing activ- 
ity, even though potent neutralization of the SF162 and 
SF162AV2 isolates was still recorded. 

Despite the fact that following this DNA-plus-protein 
booster immunization, the binding antibody titers in animals 
vaccinated with the unmodified immunogen were higher than 
those in animals vaccinated with the modified immunogen 
(Fig. 3), the former sera failed to neutralize any of the heter- 
ologous isolates tested (Table 2) (i.e., less than 50% specific 
neutralization was recorded). Thus, although in rabbits the 
unmodified immunogen was able to elicit (albeit much less 
efficiently than the modified immunogen) neutralizing antibod- 
ies against some heterologous primary HIV-1 isolates (Table 
1), it failed to do so in rhesus macaques. 

In parallel, we evaluated the susceptibility of the heterolo- 



gous isolates to neutralization by sera collected from macaques 
that had been immunized with the recombinant SF2-derived 
gpl20 protein. This protein was previously evaluated as a vac- 
cine candidate and was ineffective in eliciting cross-reactive 
neutralizing antibodies; i.e., less than 50% neutralization at 
serum dilutions of 1:10 was recorded (22). All of the isolates 
tested here were not susceptible to neutralization by antibodies 
elicited by the SF2 gpl20 protein (Table 2). 

Second booster immunization with the modified AV2 gpl40 
protein. Although the above results indicated that the modified 
AV2 gpl40 immunogen was indeed more effective in eliciting 
cross-reactive neutralizing antibody responses than the un- 
modified immunogen, these responses were weaker than those 
recorded against the parental SF162 isolate (Fig. 5). In an 
effort to further increase the potency and breath of these 
responses, we attempted to further boost the antibody titers in 
animals H445 and J408 by immunizing them one additional 
time with the purified oligomeric AV2 gpl40 protein (this time 
in the absence of DNA immunization). 

An increase in antibody titers was indeed recorded following 
this protein boost, so that at their peak values (1:145,000 end- 
point ELISA titers) the. titers were approximately threefold 
higher than those recorded during the first booster immuniza- 
tion with DNA plus protein (Fig. 6A). In parallel, we recorded 
a significant increase in the titer of neutralizing antibodies 
against the homologous SF162AV2 and parental SF162 iso- 
lates (Fig. 6B). No differences in the neutralizing potential of 
the sera collected 2 and 5 weeks following this last boost were 
recorded, even though the binding antibody titers decreased 
significantly during the same period. Unexpectedly, however, 
the neutralizing potential of the same sera against most of the 
heterologous primary isolates tested generally decreased (Ta- 
ble 2). Thus, with the exception of the BZ167, 92US657, and 
ADA isolates, all of the heterologous isolates tested were re- 
sistant to neutralization by sera collected 2 weeks following the 
second boost. Interestingly, although isolate 92US657 was re- 
sistant to neutralization by sera collected following the first 
boost, it became susceptible to neutralization by sera collected 
from animal H445 following the second boost. 
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FIG 3 Generation of antibodies in rhesus macaques. The generation of antienvelope antibodies in two animals (J408 and H445) immunized 
with the modified AV2gpl40 immunogen and two animals (P655 and N472) immunized with the unmodified SF162gpl 40 immunogen, as well as 
control animals (M844 and H473) immunized with the DNA vector alone, were determined by ELISA using the corresponding protem as described 
in Materials and Methods. Dashed lines indicate times of immunizations. DNA, animals received three monthly immunizations with DNA vectors 
expressing the gpl40 form of each immunogen. Control animals received the DNA vector alone. DNA plus Protein animals received a fourth DNA 
immunization and at the same time were immunized with the corresponding CHO cell-produced oligomers gpl40 proteins, mixed in the adjuvant 
MF-59C. Control animals received adjuvant alone. 



Generation of anti-V3 loop antibodies in rhesus macaques 
vaccinated with the modified AV2 gpl40 immunogen. One 

explanation for the increase in neutralizing activity against the 
parental SF162 and homologous SF162AV2 viruses and the 
decrease in neutralizing activity against the heterologous iso- 
lates following the second booster immunization is that multi- 
ple immunizations with the modified AV2 gpHO protein in- 
creased the titer of antibodies directed against epitopes that 
are uniquely (or predominantly) expressed on the SF162 and 
SF162AV2 envelopes. It is conceivable that multiple immuni- 
zations with the AV2 gpl40 protein result in the generation of 
high titers of anti-V3 loop antibodies. To determine the titers 
of such antibodies, we used V3 loop peptide-based ELISAs 
using the SF162/SF162A V2-derived V3 loop (Fig. 7). This pep- 
tide was recognized by antibodies binding to both linear 
(447D) (7, 12) and conformational (391-95D) (29) epitopes 
(Fig. 7A). Although anti-V3 loop antibodies were generated 
upon immunization of macaques with the modified AV2 gpl40 



immunogen, their titers were much lower than those against 
the entire envelope (Fig. 7B). In addition, the second booster 
immunization did not increase the titer of anti-V3 loop anti- 
bodies. It should be noted, however, that certain anti-V3 loop 
antibodies present in the sera of these animals may not interact 
efficiently with the V3 loop peptide in an ELISA format, while 
they may bind to their epitopes on the native envelope (23). 
Additionally, the V3 loop peptide used here does not span the 
carboxy and amino termini of the V3 loop, and our assay does 
not detect antibodies targeting these two regions. Thus, a more 
detailed examination of the epitope specificity of the antibod- 
ies elicited by the modified AV2 gpl40 immunogen is required. 

DISCUSSION 

In this study we compared the immunogen icities of soluble 
oligomeric gpl40 envelope proteins derived from related neu- 
tralization-resistant (SF162) and neutralization-susceptible 
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FIG 4 Neutralizing activities of rhesus macaque sera. Neutralization activities against the SF162 and SF162AV2 viruses of sera collected from 
animals immunized with the modified AV2 gpl40 (A) and the unmodified (B) SF162 gp!40 immunogens were determined as described in Materials 
and Methods Dashed lines indicate 50, 70, and 90% inhibition of infection. Results are representative of three to five independent experiments. 
Data indicate the mean and standard deviation from triplicate wells. Pre-bleeds, sera collected before vaccination; 2nd DNA and 3rd DNA, sera 
collected 1 month following the second and the third, respectively, DNA administrations; 2 and 4 weeks post 1st boost, sera collected 2 and 4 weeks, 
respectively, following the DNA-plus-protein booster immunization. 



(SF162AV2) viruses (30). The only difference between the two 
immunogens is the absence of 30 amino acids from the V2 loop 
of the SF162AV2-derived immunogen (30). 

We first performed immunization studies in rabbits, where 
we observed that although both proteins elicited similar titers 
of binding antibodies (Fig. 1), the modified immunogen elic- 
ited higher titers of neutralizing antibodies against isolates 
expressing not only the modified SF162AV2 envelope but also 
the unmodified parental SF162 envelope (Fig. 2). Our results 
are in agreement with those made on the background of the 
SIVmac239 virus (25) because they suggest that specifically 
modified envelope immunogens may elicit antibodies capable 
of neutralizing isolates expressing the parental unmodified en- 
velope. However, our results contrast with those obtained dur- 
ing the immunization of rabbits with a more extensively mod- 
ified HIV-1 envelope-derived immunogen, which lacked not 
only the V2 loop but also the VI and V3 loops (20). The 
reasons for this discrepancy are not known, but it would be 
interesting to determine whether the triple-loop-deleted SF162 
envelope also fails to elicit neutralizing antibodies. 



In rabbits, both the unmodified SF162 and modified AV2 
gpl40 immunogens elicited neutralizing antibodies against sev- 
eral heterologous primary HIV-1 isolates, but the potential of 
the modified immunogen to do so was greater (Table 1). Thus, 
not only did more animals vaccinated with the modified im- 
munogen elicit cross-reactive neutralizing antibodies, but also 
the breadth and potency of the cross-neutralizing responses 
were higher in sera collected from these animals than from 
animals immunized with the unmodified immunogen. We be- 
lieve that this is a consequence of the greater immunogenicity 
of conserved neutralizing epitopes on the modified immuno- 
gen tested here. Consequently, the modified immunogen more 
effectively elicits antibodies recognizing these epitopes than 
the unmodified immunogen. If V2 loop deletion increases the 
exposure of conserved neutralization epitopes that are poorly 
exposed on the SF162 envelope, then these two immunogens 
will elicit different titers of the same antibodies. If V2 loop 
deletion results in exposure of neutralization epitopes that are 
not exposed on the SF162 envelope, then the two immunogens 
will elicit different types of antibodies. An alternative explana- 
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tion for the increased ability of the modified AV2gpl40 im- 
munogen to elicit neutralizing antibodies was previously pro- 
posed by our group (30). It is possible that deletion of the V2 
loop from the SF162 envelope results in an alteration of the 



ratio of neutralization and nonneutralization epitopes on this 
envelope, so that more of the former epitopes are present on 
the SF162AV2 than on the SF162 envelope. If this hypothesis 
is correct, immunization with the modified AV2 gp!40 enve- 



TABLE 2. Neutralization of heterologous primary HIV-1 isolates by macaque sera 



Isolate 
(coreceptor usage) 



2F5 



MAb 



2G12 



% Neutralization" with immunogen* 



AV2 gp!40 



J 408 



H445 



SF162 gp!40 



P655 
(A) 



N472 
(A) 



SF2 gp!20 



L7I4 



L814 



91US056(R5) 

92US714(R5) 

92US660(R5) 

92HT593(R5X4) 

92US657(R5) 

BZ167(R5X4) 

ADA(R5) 



60 

70 

75 

75 

NT 

90 

NT 



70 

20 

70 

80 

NT 

75 

NT 



90 
85 
50 



NT 
90 



50 



65 
85 
80 



NT 
90 



65 
80 
80 



NT 
NT 



NT 
NT 



NT 
NT 



NT 
NT 



- Percent neutralization of a given HIV-1 isolate by sera (1: 10 dilution) collected from animals immunized with the modified AV2 gpl40 (J408 and H445), unmodified 
SF162 g P 140 (P655 and N472), and recombinant SF2 gpl20 (L714 and L814), calculated as described in Materials and Methods, taking into consideration the 
nonspecific neutralization recorded with sera collected from the same animals before immunization. - • ♦* * 

b A sera collected 2 weeks following the DNA-plus-protein booster immunization; B, sera collected 2 weeks following the final protein booster immunization ot 
animate J408 and H445. Values represent averages from two to three independent experiments. The suscepUbility of these isolates to neutralization by 2F5 and 2G12 
at 25 M-g of MAb per ml is also presented. — , not delected; NT, not tested. 
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FIG 5 Neutralization of heterologous clade B primary H1V-1 isolates by macaque sera. Neutralization activities of sera collected 2 and 4 weeks 
foltowing the DNA-plus-protein booster immunization against isolates heterologous to the vaccine primary HI V-l isolates were determined as 
de scribed in Materials and Methods. Dashed lines indicate 50, 70, and 90% inhibition of infection. The va ues represent specific neutralization 
whTcfis defined as the difference between the percent virus neutralization recorded with sera collected followmg vaccina ion and that recorded 
with sera prior to the initiation of vaccination. Data points indicate the mean percent specific neutralization from two independent 

experiments. 



lope would be expected to elicit a proportionally higher num- 
ber of neutralizing antibodies than immunization with the un- 
modified SF162 gpl40 envelope. The above possibilities are 
not mutually exclusive. 

Our vaccination studies conducted in rhesus macaques con- 
firm the observations made in rabbits, that the modified AV2 
gpl40 immunogen is more effective than the unmodified SF162 
gpl40 in eliciting neutralizing antibodies against isolates ex- 
pressing the parental SF162 envelope and heterologous enve- 
lopes. Neutralization of heterologous primary HIV-1 isolates 
was less efficient than neutralization of the parental SF162 
virus, which in turn was less efficiently neutralized than the 
homologous SF162AV2 virus. This suggests that the epitopes 
recognized by these cross-reactive antibodies are less accessi- 
ble on the surface of the heterologous isolates than on the 
surface of the SF162 virus and much less accessible than they 
are on the surface of the SF162AV2 virus. Based on our pre- 
vious neutralization studies with MAbs, we believe that dele- 
tion of the V2 loop from the SF162 envelope increases the 
exposure of epitopes participating in envelope-CD4 and -co- 
receptor binding (30). Such epitopes become only transiently 
exposed during the displacement of the VI and V2 loops that 
take place upon HIV envelope-CD4 binding (37-39). It is 
possible that these epitopes are normally masked within the 
oligomeric envelope structure by the V2 loop and deletion of 
this loop renders them permanently exposed. The difference in 
neutralization susceptibility between the SF162 and heterolo- 
gous isolates may be the result of different envelope glycosyl- 
ation patterns and/or different positioning of the V2 loop on 
the SF162 and heterologous isolates tested here. Additionally, 
we expect that although cross-reactive antibodies were elicited 
by this specific envelope modification, a large fraction of the 
antibodies elicited by the AV2 gpl40 immunogen are targeting 
epitopes.unique to the AV2 and SF162 envelopes. Our current 
results suggest that these unique epitopes must be located in 



envelope regions other than the V3 loop. This is also sup- 
ported by our observations that the heterologous primary 
HIV-1 isolates tested here were resistant to neutralization by 
sera collected from macaques immunized with the recombi- 
nant monomeric SF2 gp!20 protein, which primarily elicits 
anti-V3 loop antibodies (36). The lack of generation of high 
anti-V3 loop antibodies by our soluble oligomeric immunogens 
may not be due to the specific antigenic structure of these 
immunogens, because a recent report indicated that immuni- 
zation with the oligomeric envelope derived from the HXB2 
isolate also failed to elicit high titers of V3 loop-directed an- 
tibodies (11). Several heterologous primary isolates tested here 
were, however, completely resistant to neutralization by the 
antibodies elicited by the modified immunogen. Either these 
isolates may lack the epitopes recognized by the antibodies 
elicited by this immunogen or, as mentioned above, these 
epitopes may be more efficiently masked on these particular 
isolates than on the isolates susceptible to neutralization. By 
identifying the epitopes recognized by the neutralizing anti- 
bodies elicited by the AV2 gpl40 immunogen, we may be able 
to determine whether these epitopes are absent from the het- 
erologous isolates that are resistant to neutralization or 
whether they are more efficiently occluded on the surface of 
these isolates. 

The envelopes of the SF162 and SF162AV2 viruses use the 
CCR5 cellular protein to mediate virus-cell fusion in the pres- 
ence of CD4 (30). The heterologous primary isolates tested 
here used either exclusively CCR5 or both CCR5 and CXCR4 
(Table 2). We did not examine whether the antibodies gener- 
ated by the immunogens tested here, especially the modified 
one, would have neutralization potential against isolates that 
exclusively utilize CXCR4 to infect CD4 + T cells. Failure of 
our immunogens to do so would be 'an indication that the 
antibodies elicited may recognize epitopes participating in 
gpl20-CCR5 interaction. Although, we believe that an effec- 
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tive vaccine against HIV should elicit a broad spectrum of 
neutralizing antibodies, such a vaccine should primarily elicit 
neutralizing antibodies against R5-using HIV-1 isolates be- 
cause such isolates are more effective than X4-using HIV-1 
isolates in establishing a primary HIV-1 infection in exposed 
humans (15, 19, 26, 42). 

Some differences were recorded during the immunization of 
rabbits and macaques. For example, in rabbits both immuno- 
gens elicited similar binding antibody titers, while the unmod- 
ified immunogen elicited higher titers of binding antibodies in 
macaques. Also, although in rabbits the unmodified immuno- 
gen elicited neutralizing antibodies (albeit infrequently and at 
low titers) against a few heterologous HIV-1 isolates, it failed 
to do so in macaques. Finally, some isolates, such as 92HT593 
was neutralized by rabbit but not by macaque sera. Whether 
these differences are due to differences in the way the immune 
systems of these two animal species reacts to the same im- 
munogen, or whether they are due to the different methods of 
immunization (gene gun in the case of rabbits and intramus- 
cular plus intradermal needle DNA injections followed by pro- 
tein immunization in the case of macaques) merits further 
investigation. 

It is important to note that in the case of the heterologous 
primary HIV-1 isolates tested here, we rarely recorded 90% 
inhibition of the infection with the neutralization assay that we 
used at the lowest serum dilution evaluated (1:10) and that the 
cross-neutralizing activity was lost within 5 weeks following the 
DNA-plus-protein booster immunization. We corrected our 
neutralization data for nonspecific neutralization recorded 
with autologous sera collected before vaccination, because it 
was recently reported that preimmunization sera (1:4 dilution) 
collected from immunized human volunteers could inhibit by 
up to 80% the infection of several primary HIV-1 isolates (2). 
Rabbit or macaque preimmunization sera (1:10 dilution) used 
here generally did not inhibit by over 30 to 35% the infection 
of the isolates tested. Although it is possible that 90% inhibi- 
tion of infection may be recorded more frequently at lower 
serum dilutions, we believe that to increase the potency and 



breadth of neutralization, additional envelope modifications 
must be introduced. These modifications should increase the 
exposure and/or the number of conserved neutralization 
epitopes on the immunogen. However, because an increase in 
epitope exposure may not automatically result in an increase of 
epitope immunogenicity (20), the effect of a particular enve- 
lope modification on envelope immunogenicity can be deter- 
mined empirically only by in vivo immunogenicity studies in 
relevant animal models. Nevertheless, the achievement of the 
observed level of cross-reactive neutralizing activity in a non- 
human primate warrants further evaluation of this particular 
envelope modification approach. Neutralization epitopes may 
also become exposed during the propagation of HIV in cells 
lacking receptor molecules (14, 17) or by thermal or chemical 
treatment of virions (13). Envelope molecules derived from 
such viruses, when used as immunogens, may also elicit high 
titers of neutralizing antibodies. 

It would also be important to compare the protective poten- 
tials of antibodies elicited by our modified and unmodified 
immunogens. In this regard, we recently reported that the 
antibodies elicited by the modified AV2gpl40 immunogen of- 
fered partial protection from challenged with the related 
pathogenic SIV-HIV himeric virus SHIV SF162P4 (6). However, 
the ability of the antibodies elicited by our modified immuno- 
gen to protect macaques from heterologous viral challenge is 
not yet known. Importantly, vaccination methodologies that 
generate and sustain significant titers of cross-reactive neutral- 
izing antibodies must be developed. 

During the second booster immunization (protein alone) of 
macaques with the modified AV2 gpl40 immunogen, we ob- 
served that although an increase in the potency of neutralizing 
antibody responses was recorded in the case of the homolo- 
gous and parental viruses (Fig. 6), a parallel increase against 
heterologous primary HIV-1 isolates was not recorded (Table 
2). Thus, multiple immunizations with the purified modified 
AV2 gpl40 protein may preferentially increase the titer of 
antibodies recognizing epitopes that are unique to the SF162 
and AV2 envelopes. Whether this is related to the fact that the 
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DNA-expressed protein has an intact gpl20-gp41 cleavage site, 
while this site is absent from the CHO cell-produced protein is 
not known. It is possible that conserved neutralization epitopes 
are more efficiently exposed on soluble oligomeric gpl40 pro- 
teins with an intact gpl20-gp41 cleavage site. In that respect, 
differences in the structures of gpl40 proteins derived from the 
SF162 and AV2 envelopes with intact or absent gp!20-gp41 
cleavage sites were reported by our group (32). 

In summary, our current data strongly suggest that unless 
specific modifications are introduced in soluble oligomeric en- 
velope constructs derived from R5-using, primary HIV-1 iso- 
lates, the breadth and potency of the neutralizing responses 
elicited by such vaccines will be weak. The challenge is to 
identify additional envelope modifications that would further 
increase the immunogenicity of neutralization epitopes whose 
structures are conserved among primary HIV-1 isolates. 
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DNA Vaccination with the Human Immunodeficiency Virus Type 1 
SF162AV2 Envelope Elicits Immune Responses That Offer Partial 
Protection from Simian/Human Immunodeficiency Virus Infection 
to CD8 4 " T-Cell-Depleted Rhesus Macaques 

S CHERPELIS, 1 I. SHRIVASTAVA, 2 A. GETTIE, 1 * 3 X. JIN, 1 D. D. HO, 1 S. W. BARNETT, 2 

and L. STAMATATOS 1 * 
Aaron Diamond AIDS Research Center, The Rockefeller University, New York, New York 10021-6399 1 ; 
Chiron Corporation, Emeryville, California 94608-2916 2 ; and Tulane Regional Primate Research Center, 

Covington, Louisiana 70433 

Received 14 July 2000/Accepted 30 October 2000 

DNA immunization of macaques with the SF162AV2 envelope elicited lymphoproliferative responses and 
potent neutralizing antibodies. The animals were depleted of their CD8 + T lymphocytes and then challenged 
intravenously with SHIV162P4. Compared to unvaccinated animals, the vaccinated macaques had lower peak 
viremia levels, rapidly cleared plasma virus, and showed delayed seroconversion. 



DNA immunization stimulates both the cellular and hu- 
moral arms of the immune system (9, 16-18) and elicits im- 
mune responses capable of preventing infection of animals by 
slowly replicating viruses, such as human immunodeficiency 
virus type 1 (HIV-1) in chimpanzees (3). However, when the 
challenge virus replicates efficiently in the host, such as simian 
immunodeficiency virus (SIV) or SIV/HIV (SHIV) in ma- 
caques, the DNA-elicited immune responses offer only par- 
tial protection (4, 10, 14). To increase the potency of these 
responses, especially the development of high anti-HIV or 
-SIV envelope antibody titers, follow-up administration of sol- 
uble viral envelope proteins, viral particles or recombinant 
vaccinia-based viruses expressing the HIV or SIV envelope is 
required (1, 2, 8, 12-14). This bimodal method of immuniza- 
tion elicits responses capable of protecting rhesus macaques 
from infection by highly replication-competent SHIV (8, 14). It 
is unclear, however, whether the recorded protection was me- 
diated by the cellular and/or humoral antiviral responses elic- 
ited during DNA immunization. By evaluating and comparing 
the respective antiviral protective roles of these two types of 
responses, we hope to develop more effective DNA immuni- 
zation protocols. 

Two rhesus macaques (H445 and J408) were immunized 
both intradermal^ and intramuscularly at weeks 0, 4, and 8 
with a DNA vector (5, 22) (2 mg of total DNA each time) 
expressing the SF162AV2 gpl40 envelope with an intact 
gpl20-gp41 cleavage site (21). The DNA construct was codon 
optimized for high expression in mammalian cells. At week 27, 
the animals were immunized one additional time with DNA 
and with the CHO-produced, purified oligomeric SF162AV2 
gpl40 protein (100 u.g) mixed with the MF-59C adjuvant. At 
week 38, the animals were immunized one additional time with 
the adjuvanted protein alone. 
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The development of binding antibodies was evaluated by 
enzyme-linked immunosorbent assay (ELISA) methodologies 
(20). Antibodies were detectable following the second DNA 
immunization, and their titers did not increase following the 
third DNA immunization (Fig. 1A). During the following 5 
months, the titers decreased gradually, but were always detect- 
able. The first "boost" increased the titers by approximately 1 
to 2 Iog 10 from the peak value recorded following the third 
DNA immunization. The titers gradually decreased and lev- 
eled off during the following 11 weeks, at which point the 
animals received a second boost, which further increased the 
antibody titers. Neutralizing antibodies (NA) were evaluated 
by using the activated peripheral blood mononuclear cell 
(PBMC) target assay (19), using preimmunization sera to cor- 
rect for nonspecific neutralization (Fig. IB). Following the 
third DNA immunization, the NA titers in animal H445 were 
lower than those in animal J408, even though the binding 
antibody titers were similar between the two animals. The NA 
titers against both SF162AV2 and SF162 increased significantly 
during the subsequent boosts. Vaccine-specific proliferative 
responses were recorded in both animals. Stimulation indices 
(Sis) of 5 and 10 were recorded following the first boost in 
animals J408 and H445, respectively. The second boost in- 
creased the potency of these responses in animal H445 (SI of 
25), but not in animal J408 (SI of 5). 

To evaluate the protective role of the anti-HIV envelope 
antibodies elicited by our vaccine, we depleted the CD8 + T 
lymphocytes from the vaccinated animals prior to viral chal- 
lenge (Fig. 2). CD8 depletion was achieved by three intrave- 
nous administrations of the anti-CD8 MAb OKT8F (2 mg/kg 
of body weight) at daily intervals (7). CD8 + T lymphocytes 
remained undetectable for approximately 10 days. Concomi- 
tantly, we recorded a decrease in the total number of circulat- 
ing CD3 + T cells. This indicates that the recorded depletion of 
CD8 + T cells from the periphery is due to their actual elimi- 
nation. Although we did not evaluate CD8 depletion from the 
lymph nodes, it was previously demonstrated that a concomi- 
tant depletion of CD8 + T cells from the periphery and lymph 
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Serum Dilution 

PIG 1 Generation of anti-HIV envelope antibodies during immunization. (A> Binding antibodies. The envelope-specific titers of binding 
antibodies in animals J408 and H445 throughout our immunization schedule were determined against the vaccine i.e. the P^d o hgc ►rnenc 
SF162AV2 epl40 protein. Dashed lines indicate the time of immunization and the arrow indicates the time of viral challenge. (B) Neutralizing 
antibodies. The presence of neutralizing antibodies against the homologous SF162AV2 virus (left panels) and the parental SF162 viruses (n^t 
Sands) was determined at various time joints during the immunization schedule: O, prebleeds; 1 month past the third DNA immunization; ■, 
2 weeks following the 1st boost; and 2 weeks following the 2nd boost. 



nodes occurs when anti-CD8 monoclonal antibodies (MAbs) 
are introduced into the blood circulation of macaques (11, 15). 

One day following the last administration of OKT8F, the 
immunized and two unimmunized naive animals were chal-' 
lenged intravenously with 100 50% tissue culture infective 
doses of a cell-free stock of the SHIV162P4 virus (6). This 
isolate was neutralized by 50 and 90% by sera (1:5 dilution) 
collected at the day of challenge from animals H445 and J408, 
respectively. 

Both vaccinated and unvaccinated animals became infected; 
however, differences in the peak viral load levels and viral set 
points were noted between the two groups (Fig. 3A). Eleven 
days postchallenge, plasma viremia in the vaccinated animal 
H445 was lower by 2 and 4 log 10 compared to that of the 
unvaccinated animals A141 and AT54, respectively, while the 
vaccinated animal J408 was aviremic. At peak viremia, plasma 
viral levels in the vaccinated animals were 1 to 4 log 10 lower 
than those in the unvaccinated animals. Following peak vire- 
mia, an initial rapid decrease followed by a more gradual 
decrease in plasma viral loads was recorded in the unvacci- 
nated animal A141, while sustained high viral loads were re- 
corded in the second unvaccinated animal, AT54. A very rapid 
decrease to undetectable levels was recorded in both vacci- 
nated animals within 35 days postchallenge. 

Concomitant with the appearance of plasma viremia in the 



vaccinated animal H445, a rapid increase (by approximately 
fivefold) of the anti-HIV envelope antibody titers was moni- 
tored (Fig. 3B). Subsequently, as the viral load in this animal 
decreased to undetectable levels, the antibody titers gradually 
decreased to prechallenge titers. In contrast, the antienvelope 
antibody titers did not increase in the second vaccinated ani- 
mal, J408, which had the lowest levels of peak plasma viremia. 
In the unvaccinated animals, anti-HIV envelope antibodies 
became detectable approximately 30 days postchallenge. Al- 
though their titers increased over time in animal A141 they 
remained weak and eventually declined prior to death in ani- 
mal AT54. 

The two unvaccinated animals seroconverted to SIV p27 ffrtS 
and pol 31 proteins within 2 weeks postchallenge, while the two 
vaccinated animals remained seronegative for the first 17 
weeks postchallenge (data not shown). Also, although virus 
was recoverable from rhesus macaque PBMCs collected from 
the unvaccinated animals at 18, 42, and 48 days postchallenge, 
it was only recoverable at day 18 from the vaccinated animals 
(data not shown). Finally, in contrast to the two vaccinated 
animals and the unvaccinated animal A141, which have re- 
mained healthy so far, the second unvaccinated animal, AT54, 
died from simian AIDS 16 weeks postchallenge. 

We believe that the anti-HIV envelope antibodies elicited by 
our vaccination, although not capable of completely eliminat- 
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FIG. 2. Depletion of CD8 + T lymphocytes. CD8 + T lymphocytes were depleted from the vaccinated animals by bolus injection of the anti-CD8 
MAb OKT8F (arrows). The numbers of circulating CD4 + (solid symbols), CD8 + T (open symbols), and total CD3 + T lymphocytes (asterisks) from 
vaccinated and unvaccinated animals was determined in samples collected at various points prior to and following SHIV162P4 challenge (dashed 
line). 



ing all the infectious SHIV162P4 viral particles introduced in 
the blood circulation during challenge, reduced their number, 
so that fewer numbers of CD4+ T lymphocytes became initially 
infected in the vaccinated than the unvaccinated animals. This 
in turn resulted in lower peak levels of plasma viremia during 
acute infection in the vaccinated animals. The observation that 



the lowest levels of peak plasma viremia were recorded in the 
vaccinated animal J408, whose serum had the strongest neu- 
tralizing activity against SHIV162P4 at the day of challenge, 
suggests that neutralizing antibodies played an important pro- 
tective role during the first 7 days postchallenge. However, in 
addition to neutralizing antibodies, envelope-specific antibod- 




FIG. 3. Viral load and generation anti-HIV envelope antibody titers following SHIV162P4 exposure. (A) Viral load expressed as number of 
RNA copies per milliliter of plasma. Dashed lines indicate the detection limit of this assay (<500 copies per ml), t, animal AT54 was euthanized 
111 days postchallenge following the development of simian AIDS. The arrow indicates the time at which CD8 + T lymphocytes reappeared in the 
periphery of the vaccinated animals. (B) The generation of anti-HIV envelope antibodies following SHIV162P4 challenge was monitored by 
SF162AV2 gpl40-based ELISA methodology. The end point ELISA titers are presented. 
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ies without neutralizing activity may have been elicited by our 
vaccine and may also have contributed to viral clearance. The 
fact that strong anamnestic anti-HIV envelope responses were 
developed in animal H445 immediately following SH1V chal- 
lenge indicates that antibodies contributed to the rapid viral 
clearance to undetectable levels. However, because the CD8^ 
lymphocytes reappeared in the periphery of the vaccinated 
animals 7 days postchallenge, they may also have contributed 
to this rapid viral clearance. 

We should point out, however, that in addition to protective 
antibody responses, our vaccination might have elicited CD4 T 
T-cell-mediated protective responses, which most likely were 
present during CD8 depletion. It would be important to assess 
whether DNA vaccination elicits such responses. 

In summary, these pilot studies highlight the important pro- 
tective role of non-CD8-mediated DNA-vaccine-induced anti- 
HIV envelope responses and strongly suggest that efforts to 
develop an effective anti-HIV vaccine should not only aim at 
eliciting strong cytotoxic T-lymphocyte responses. 
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